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Preface to the Second Edition 


Imagine the following telephone conversation between a statistician (S) and a research 
scientist (R). R: “Hello, Mr. Stat, I wonder whether you have just a minute for a quick 
statistical question.” S: “Usually I do not do statistical consulting over the phone, but 
let me see if I can help you. What is the problem?” R: “We are developing new 
growth media for industrial producers for growing flower plants. We have three such 
media and we use them with four flower varieties. We have five replications for each 
combination of medium and flower. We have analyzed the data as a 3 x 4 two-way 
classification with five observations per cell. But my graduate assistant has talked to 
one of your students and he is now confused about the validity of this analysis. I just 
want you to confirm that we have done the right thing.” S: “Well, I do not know.” R: 
“What do you mean, you do not know? You are the expert!” S: “I really need to know 
more about how you performed the experiment. For example, how did you prepare the 
media that you used in the individual pots? I assume that you grow the flowers in pots 
in the greenhouse.” R: “Yes，that is right. My graduate assistant simply mixed each 
medium in a big container, which we then put in the individual pots.” S: “That may be 
a problem, because now you may not have any replication,” R: “What do you mean, 
we have no replication? I just told you that we have five replications.” S: “Yes ， but... 
I think it would be best if you would come to my office for me to explain this to you 
and to take a closer look at your experiment.” R: “But we have already submitted the 
paper for publication.” S: “Then why don’t you come when you get the reviews back.” 
Silence. R: “Yes, thank you. I’ll do that.” 

This is, of course, just a fictitious conversation. But many consulting statisticians 
have had similar conversations. The aim of this book is to help statisticians as well as 
research scientists not only to better understand each other but also to obtain a better 
understanding of the intricacies of designing and analyzing experiments It is our hope 
that this can be achieved by using the book as a textbook as well as a reference book. 

Having used the first edition for several years as a textbook in an MS level class for 
statistics students and well-qualified graduate students from other fields relying heavily 
on experimental research, I have gained valuable insight into the needs of both types of 
students. This has led to some changes and enhancements in the second edition without 
giving up the general flavor and philosophy of the book. 

Although some readers may feel the book is too theoretical, we strongly believe 
that these developments are necessary to understand the basic ideas and principles of 
experimental design, to enable students and researchers to pursue ideas of designing 
experiments not covered in this book, and to lay the foundation for even more theo- 
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retical work as covered, for example, in Volume 2: Advanced Experimental Design. 
For the non-statistics student it is not always necessary to understand all the details 
leading to important results as long as they develop a certain feel for these results and 
appreciate their role and importance in the overall picture. A skillful teacher will be 
able to accomplish these aims without compromising the rigor of the development of 
the material. 

Having said this, I have tried to make the second edition more user-friendly by also 
emphasizing the practical aspects of designing and analyzing experiments. I have con¬ 
siderably expanded Chapter 2 with further discussion of the planning aspects of setting 
up an experiment and giving heuristic arguments why the various steps are so impor¬ 
tant for a successful experiment. This should appeal to both consulting statisticians and 
research scientists. 

Other major changes involve the development in Chapter 9, which I consider to be 
one of the most important chapters in the book because of the introduction of the notion 
of blocking. I spend a great deal of time discussing the different types of blocking 
factors and their importance in the overall scheme of setting up, analyzing, and drawing 
inference using various forms of block designs. These ideas are then carried over to 
Chapter 11, which introduces the basic concepts of factorial treatment structure and 
design. I have included a case study, based on an invited presentation at a meeting 
of the American Society for Horticultural Science, which discusses in some detail the 
role, analysis, and interpretation of various forms of interactions. 

The discussion about repeated measures has been moved to a separate chapter 
(Chapter 14) to give it more emphasis, as this type of experimentation occurs quite 
often. I explain how repeated measures can be paired with any error-control design, 
how this leads to a split-plot type structure of the experiment, and how and why the 
analysis differs from that of a split-plot experiment. 

Finally, I have added to most chapters numerical examples using the statistical 
software package SAS® (SAS Institute, Inc. 2002-2003)as a tool to analyze the data. 
This should not considered to be a tutorial in SAS, but it should provide some help 
to readers of this book about how to analyze similar data from their experiments and 
to relate such analyses to the developments, in particular ANOVA tables, given in the 
book. In order to preserve space I have omitted some information provided in the usual 
SAS output. Also, I should mention that the data are not real, even though some of the 
experiments described are, as research scientists are generally not willing to share their 
original raw data. The results presented should, therefore, not be interpreted as findings 
in a given subject matter area, but rather as illustrations of statistical procedures useful 
for analyzing such data. For readers who do not have access to SAS or who prefer to 
use other statistical software, the examples should provide some help in setting up the 
analyses in their environment. In addition to using SAS as a tool for the analysis from 
designed experiments I also show how SAS can be used for randomization procedures 
and for constructing certain types of factorial designs. 

I hope that the changes and enhancements in the second edition will prove useful 
to students, teachers, and researchers. For those who seek a deeper understanding and 
further developments of the material presented here I provide references to chapters 
and sections in Volume 2 indicated, for short, by II.xx or II.xx.yy ， respectively. 

An FTP (ftp://ftp.wiley.com/public/sci_tech_med/design_experiments/) for this book 
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will be maintained by wiley.com，which will also contain additional exercises and so¬ 
lutions to selected exercises. 

During the process of thinking about and completing this revision I have received 
help from several people. I would like to thank my students and colleagues for pointing 
out errors in the first edition and for making suggestions for changes. I am grateful to 
Yoon Kim and Ayca Ozol-Godfrey for their help with some computational and graph¬ 
ical aspects in Chapters 6 and 11. It is difficult to find the right words to express my 
profound gratitude to Linda Breeding for her tireless and skillful efforts in producing 
the camera-ready manuscript. This has been a monumental and difficult job, and even 
during times of despair she found a way to carry on until the work was completed. No¬ 
body could have done it better. Thank you, Linda. I also would like to thank Jonathan 
Duggins, Amy Hendrickson, and Scotland Leman for their expert advice and help with 
LaTeX. 

Finally, I would like to dedicate this edition to the memory of my co-author and 
mentor, Oscar Kempthorne, for his many important contributions to the philosophy, 
theory, practice, and teaching of experimental design (for a bibliography, see Hinkel- 
mann, 2001). 


Klaus Hinkelmann 
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Preface to the First Edition 


The subject of the design of experiments has been built up largely by two men ， R. A. 
Fisher and F Yates. The contributions of R. A. Fisher to mathematical statistics form 
a major portion of the subject as we now know it. His contributions to the logic of 
the scientific method and of experimentation are no less outstanding, and his book The 
Design of Experiments will be a classic of statistical literature. The contributions of 
F. Yates to the field of the design of experiments are such that nearly all the complex 
designs of value were first put forward by him in a series of papers since 1932. Both 
Fisher and Yates have also made indirect contributions through the staff of the statistical 
department of Rothamsted Experimental Station, since its founding in 1920. It is not 
surprising that the contributions originated from Rothamsted, because Rothamsted was 
probably the first place in the world to incorporate a statistical department as a regular 
part of its research staff, and the design of experiments is a subject that must grow 
through stimulation by the needs of the experimental sciences. 

This quotation from the preface of Design and Analysis of Experimen ts by Kemp- 
thorne (1952) affirms our recognition of the enormous and path-breaking contributions 
made by these two men to the field of experimental design and experimentation in 
general. Even though most of their ideas originated in connection with agricultural or 
genetic experiments, the resulting principles and designs have found wide applicability 
in all areas of scientific investigations as well as in many areas of industrial production 
and development. 

Because of the widespread use and increasing importance of experimental design, 
it is essential that students and users obtain a firm understanding of the philosophical 
basis and of the principles of experimental design as well as a broad knowledge of 
available designs together with their assumptions, their construction, their applicability, 
and their analysis. These topics then are the subject of this book which will appear in 
two volumes. 

Volume I is a general introduction to the subject laying the foundation for the devel¬ 
opment of various aspects of experimental design. In it we describe and discuss many 
of the commonly used designs and their analyses. We return to some of these designs 
and introduce other designs in Volume II at a more technical and mathematically more 
advanced level. 

With respect to the present volume, Chapters 1 through 5 describe in some detail 
the philosophical foundation and the mathematical-statistical framework for our ap¬ 
proach to the discussion of experimental design. We put the notion of and the necessity 
for intervention studies, the main topic of this book, squarely into the context of the 
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scientific method. We develop and draw a sharp distinction between observational and 
intervention studies, a theme to which we return at various places throughout the book, 
in particular in connection with the analysis of data. Much of the analysis is based on 
the theory of linear models. A thorough discussion of linear models theory is given in 
Chapter 4. Our major aim here is to provide the reader with the basic tools to under¬ 
stand and develop the analysis of data from intervention studies of the sort discussed 
in this book. 

Although linear models play a fundamental role we stress the fact that they do 
not exist in and of themselves but that they evolve from very basic principles and in 
the context of the experimental situation at hand. Indeed, in Chapter 2 we argue that 
many facets are involved in advancing from a research idea or question to a designed 
experiment which permits the investigator to draw valid conclusions. Some of these 
facets are of a statistical nature such as developing an appropriate experimental design, 
developing an appropriate model, and carrying out an appropriate analysis, and they 
are the subject of this book. But it is important, we assert, to always keep in mind that 
statisticians and subject-matter scientists have to combine their knowledge to develop 
an experimental protocol according to sound principles of both fields. 

We have alluded earlier to the impact that R. A. Fisher had on the development 
of the field of experimental design. One of his contributions concerning the design of 
experiments is the use of randomization. In Chapter 5, as well as in following chapters, 
we discuss the general idea and then apply it to specific designs. It forms the basis of 
the analysis for all intervention studies. 

Beginning with Chapter 6 we develop from first principles various error-control de¬ 
signs. We start with the completely randomized design (Chapter 6) as the simplest form 
of error-control design and then move on to more complex error-control designs such 
as randomized block designs (Chapter 9), Latin square type designs (Chapter 10) and 
split-plot type designs (Chapter 13). For each design we derive linear models and the 
associated analyses, mainly in the form of analyses of variance. Other forms of analy¬ 
sis such as estimating and testing treatment contrasts are dealt with in Chapter 7. And 
further reduction of experimental error through the use of supplementary information 
is described in Chapter 8. 

The notion of treatment design is introduced in Chapter 11 when we discuss facto¬ 
rial experiments. Particular attention is paid to experiments involving factors with two 
and three levels. This serves as an introduction to the vast opportunities and techniques 
that are available for such type of experimentation. We emphasize in particular how 
treatment designs can be combined with or embedded in error-control designs in the 
form of systems of confounding. 

In Chapter 12 we touch briefly on a different form of experiment designs: response 
surface and mixture designs. It serves mainly to point out the difference between com¬ 
parative and absolute experiments, but it also serves to show how error-control designs 
and treatment designs can be applied towards the construction of response surface de¬ 
signs. 

Although many experiments can be conducted using the designs discussed here and 
in Volume II, there are many others for which special designs need to be constructed. 
It is our aim here to lay the foundation for such work by discussing in detail the major 
principles of experimental design, such as randomization, blocking (in particular in- 
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complete blocks), the Latin square principle, the split-unit principle, and the notion of 
factorial treatment structure. 

The relationship of the present two volumes to the book Design and Analysis of 
Experiments by O. Kempthorne published in 1952 merits some discussion. Very much 
is common. We have felt it absolutely necessary to add a long chapter on the pro¬ 
cess of science, discussing our perception of observation theory in science, the role of 
experiments, the role of data analysis, and the introduction of ideas of probability as 
related to relative frequency in a defined population of repetitions. The presentation 
of least squares and the general linear hypothesis needed large improvement. We have 
based most of the data analysis and inference on randomization analysis, expanding 
and formalizing the presentation. The remainder of the presentation in the present two 
volumes is a considerable expansion and rearrangement of standard material of the 
1952 book taking many of the developments during the last 40 years into account. 

The organization and presentation of the material has evolved over a number of 
years of teaching the subject to graduate students in statistics. Volume I is intended 
as a textbook for a one-semester course for first year graduate students. To make the 
course effective, the students should have been exposed to a fairly rigorous course 
in statistical methods. They should be familiar with the basic principles of statistical 
inference and with the rudimentary ideas of analysis of variance and regression, i.e .， 
they should have some understanding of and appreciation for linear models and their 
role in statistical inference. 

The book contains more material than can be taught reasonably in one semester, and 
hence a selection of topics will have to be made. This will depend to some extent on the 
students’ background and preparation. One suggestion is to skip some details and omit 
certain parts in individual chapters, Another is to omit much of Chapter 4 and Chapter 7 
and omit all of Chapter 12 (at Virginia Tech, for example, there exists a concurrent 
course in the theory of linear models covering the material in Chapter 4, most of the 
material in Chapter 7 will have been covered in a course on applied statistics, and 
there exists a separate course for response surface designs). At any rate, Chapters 6 
through 13 are fairly self-contained except that frequent reference is made to results in 
Chapter 4 for a better understanding of the underlying principles. 

The reader will notice that no numerical examples are given throughout the text. 
It is assumed that the reader is familiar with mathematical notation and does not have 
any difficulty with reading and handling formulas. There is no emphasis at all on 
calculations. Instead, we provide some guidance on how to use available statistical 
software. This attempt is, however, rather limited in that we refer only to SAS as an 
example of available software packages，and even that is by necessity not complete. 

A thorough knowledge of the material in Volume I is a prerequisite for understand¬ 
ing Volume II. As mentioned before, the presentation of the material in Volume II is 
more technical and hence suited for a more advanced course in experimental design. 
It contains more than enough material for another one-semester course. In addition, 
Volume II is intended to serve as a reference book on many topics in experimental 
design. 

Many people have contributed to this book in different ways. Foremost among them 
are our students who have been exposed to this material over the years. We thank them 
for their questions and comments. K. H. would like to thank Virginia Tech for granting 
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him study-research leaves and the Departments of Statistics at the University of New 
South Wales and Iowa State University for providing him with support, facilities, and a 
congenial atmosphere in which to work. O. K. is grateful to Iowa State University for 
providing more than 40 years of stimulating environment and association with many 
graduate students of high ability. We thank Yoon Kim and Sungsue Rheem for help 
with the simulations for the randomization analyses, and Markus Hlittmann and Sandra 
Schlafke for extensive help with the preparation of the index. And finally, we express 
our deep appreciation and gratitude to Linda Breeding and Ginger Wenzlik for their 
expert typing and word-processing. 

Klaus Hinkelmann 

Oscar Kempthorne 



CHAPTER 1 


The Processes of Science 


1.1 INTRODUCTION 

In order to understand the role of statistics ， generally, and the role of design of experi¬ 
ments in particular, it is useful to attempt to characterize the processes of science and 
technology. All science and technology starts with questions or problems. The grand 
aim is to develop a model which will describe adequately, that is, accurately, the past, 
present, and future of the universe. Obviously, if we are to describe the future, we must 
have a model that incorporates development over time — that is, a dynamic model, and 
a model that predicts what alterations will be brought about by interventional acts, such 
as drug therapy, reducing money supply, or supplying a nation with armaments, to give 
widely disparate examples. 

1.1.1 Observations in Science 

The foundation of all science is, obviously, observation. This, which we all do every 
waking minute of our lives, would seem to be a very simple matter, with a logic that 
is entirely clear. It is not clear from several points of view. Curiously enough, it 
is not discussed, it seems by philosophers of knowledge. It is obvious that animals 
make observations — all one has to do is to try to catch a rabbit. It observes that it 
is being chased and takes evasive action. This is, presumably, an instinct bred into 
rabbits by the evolutionary process. In science, a reaction to a portion of the world is 
an observation only if that reaction can be recorded, perhaps only in memory, or better, 
of course, by actual physical recording. To do this requires a language and descriptive 
terms. It is necessary that an observation can be described in terms that have some 
meaning to others. The development of a language for this purpose, a language that is 
effective, is a process of science that continues. We need only look at the development 
of the language of biology. This field is full of names of things, and indeed, one of 
the great difficulties of the field is to learn the naming that has been developed in the 
past, a task that becomes more and more difficult as processes of observation are being 
developed, one can almost say, day by day. Many parts of the journal Science of today 
are unreadable except by experts and would be unreadable for the experts of decades 
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ago. The development of this type of descriptive language proceeds with care, and 
with the discipline of the area of study. A descriptive term does not receive validation 
until it is agreed on and can be confirmed by any observer who follows the prescribed 
protocol of observation and has been educated in the use of the descriptive terms. This 
is no more than a cliche in physical and biological science and one might be led to the 
view that it is not worth stating. But when we turn to any aspect of human mental status 
or mental behavior, the “obvious” cliche becomes critical. One merely has to look at 
the nosology that occurs in psychiatry to see the problem. This is not to imply that 
workers in that area are dolts — the area is remarkably difficult because of the problem 
of validation of observation. 

A second point about observation is that it is by its very nature incomplete. One 
observes, one says, a robin outside one’s window. Humanity uses this mode of expres¬ 
sion and it has served it well. But one does not observe the whole of the phenomenon. 
Just recall the commonplace interchange. Person A says “I see a robin.” Person B says, 
“Yes, I see it too. Did you notice that it has a gray bar on its wing tips?” Person A says, 
“No, I did not notice that, but now that you mention it, I do see it.” Person A’s observa¬ 
tion was incomplete relative to B’s observation. Obviously, there can be person C, who 
sees more. Also, obviously, observation is not an innate ability; it is one which may 
require high “professional” training — even in areas that use no more than the ordinary 
unaided human eye. For the naturalist of the sixteenth century, for the ordinary citizen 
naturalist, and for the person who has received two years of training, observation is not 
at all the same. If we adjoin the obvious massive development of observational pro¬ 
cesses, with physical devices, for instance, observing in infrared light, observing with 
an electron microscope, and so on, there is not an elemental activity which we can call 
“making observation.” 

Another aspect, which is much more subtle, is that the process of observation may 
or may not have an effect on what is being observed, with the elementary consequence 
that one simply cannot observe the status of an object of observation. There are, of 
course, elementary techniques for combating this, as in the use of blinds to observe 
birds, or of walls that have one-way vision. But when one considers observing, or 
trying to observe, the mental state of a human and leaving aside the possibility of ob¬ 
serving what one thinks to be physical correlates of mental status, one has to talk to 
the human and ask questions and then it is not at all clear what the status of ensuing 
conversation by the human being observed is. Turning aside from an obviously fantas¬ 
tically difficult area, we saw a revolution in physics at the beginning of the twentieth 
century with the realization that one could not look at a particle except by shooting 
another particle at it and getting a collision. This type of situation led to the famous 
Heisenberg uncertainty principle in an area for which it was thought previously that 
one could observe without interfering with the object observed. This phenomenon has 
huge consequences as any modern physicist knows. It has, also, huge consequences 
with respect to epistemology. 

1.1.2 Two Types of Observations 

We leave this discussion. For the purposes of our discussion here, we assume that there 
is a validated process of observation that has no effect on the object being observed. We 



1.1. INTRODUCTION 


3 


must, however, discuss a major point. There are fundamentally, it seems, two types of 
observation. The first consists of placing an observation of an object of observation in a 
class: for instance, the flower being observed is pink or has pinnate leaves. In most cir¬ 
cumstances, there is no doubt of the recorded observation (though one can be doubtful, 
e.g., on a color designation). In other cases, the result of the observation is uncertain; 
we merely have to imagine being given sequentially with repetition unknown to the ob¬ 
server of a set of colored blocks that do not have strongly distinguishable colors. One 
will find that one’s observation of a block will give different results in repetitions, over 
which one is fairly sure that color has not changed. In such cases, one has no recourse 
but to use a probability model to the effect that, in repetitions that are unconnected in a 
known way, there will be a frequency distribution of observational outcomes. We shall 
not be concerned with this at present. The second type of observation, which perme¬ 
ates quantitative science, is the measurement of a numerical magnitude, for example, 
the weight of a piece of rock, which one is confident does not change. In this case, 
there is always an error and an imprecision of measurement. The nature of the error 
and of the imprecision is again representable by some frequency distribution of results. 
This type of problem permeates, of course, the physical sciences, and increasingly so, 
as the sought after observation, such as weight, becomes smaller and smaller. 

We hope that we have given a useful discussion of observation, though elementary 
and potentially highly obscure at a philosophical level. We take comfort in the fact that 
even if the process of observation is quite unclear (as it is at a fundamental level), the 
world of science is permeated with interpersonally validated observation. 


1.1.3 From Observation to Law 

Our writing here is aimed at constructing a useful model of what happens in science. It 
seems clear that the beginning of science is observation and description. It also seems 
clear that this is still a critical feature of science. This observation process requires no 
theory. It is interesting in this connection, to look at what Darwin (1809-1882) did in 
the voyage of the Beagle. He was, for his age, a very remarkable observer. The point 
of expressing the above views is that it is sometimes said that the mere collection of 
observations has to be based on a theory, a concept to which we shall turn. If one wishes 
to state that even the simplest observation is based on the informal theory that one’s 
observation process obtained an attribute of what is observed and not of the observer, 
one cannot object. Apart from this, much observation has been generated not by any 
theory but by curiosity, an attribute that one observes in animals. It is true, of course, 
that, very often, observation is initiated by a question or by a problem situation. One 
could say that curiosity is the result of a question, but this seems to be mere playing of a 
verbal game. Obviously, our presentation is Baconian and we quote his first Aphorism 
(Spedding et al., Vol. I, 1861, p. 241): 

Homo, Naturae minister et interpres, tantum facit et intelligit quantum de Nature 

ordine re vel mente observaverit, nec amplius scit aut potest. 

for in its translation (Spedding et al., Vol. VIII, 1863, p. 67): 


Man, being the servant and interpreter of Nature, can do and understand so much 
and so much only as he has observed in fact or thought of the course of nature: 
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beyond this he neither knows anything nor can do anything. 

However, we cannot accept the full Baconian prescription as given by his succes¬ 
sive aphorisms. 

It is obvious that science does not consist merely of a collection of, shall we say, 
interpersonally validated observations. What comes next? Rather clearly, it is the 
organization of such observations — let us call them facts, into sets of related facts. Let 
us suppose that our observations are categorical. We observe trees. This consists of 
noting, with our developed language, that there are trees that keep their leaves through 
the winter and those that do not. We deliberately take simple examples that even the 
proverbial man-in-the-street can appreciate. So this part of our observation places the 
objects of observation into one of an exhaustible polytomy. Let us call the classes 
of one polytomy ai, a 2 , … ， a r and of the second, H … We look at our 
observations and we see that in our observations every object which was 0:3 is /? 2 . 
Obviously, a generalization is suggested: every object that is as is We have a 
suggestion of a “law.” The word “law” is used in our language in many senses and 
even in science it is used in at least two senses. A “law” states that something must 
occur. The Creator has decreed so. This is one sense. Another sense, which is really 
quite different, is that a law is an empirical generalization. A hoary and false example 
is: I have seen 10 swans and they were all white. So I infer (falsely) the “law” that 
all swans are white. This example leads, of course, into the problem of induction, on 
which libraries of books have been written, without resolution. 

We then see a very curious thing happening, the development of a theory. From the 
“law” obtained as a suggested empirical generalization, we convert our generalization 
into a “law” of Nature, something that must necessarily be the case. When we do this, 
we are beginning to make a theory. This is, however, just one part of the construction 
of a theory. It is the absence of a role for theory that has been the main criticism of the 
prescription of Bacon (1560-1626). 

It is informative, here, to bring in the work of Kepler (1571-1630). The observa¬ 
tions were the positions of planets at different times of the year. The contribution of 
Kepler was to analyze the data and to show that the path of each planet was an ellipse, 
with the sun as focus, and other aspects that are given in his famous three laws. It is 
also informative to recall the work of Mendel (1822-1884) in biology. The crossing of 
types X and Y gave offspring of type Z, say, and then the crossing of these offspring 
gave an array of offspring which had the appearance that \ were X, \ were Z, and 
\ were Y. Interestingly, this appears to be the first case of occurrence of a validated 
probability model in science (apart from mere gambling). 

In the one case, we have Kepler’s laws and, in the other case, Mendel’s laws. Now, 
we have to raise the hard question. Are these laws as empirical generalizations or are 
these laws that tell us what must happen? Are they built into Nature by the Creators? 
Our answer is obvious, we think: they are merely empirical generalizations. 

How then do we get to a theory? The process is rather simple, though rarely ex- 
posited in our experience. At first, we have what may be called “naive” laws, merely 
empirical generalizations. But we want an explanation. We are in a morass and we 
shall have to discuss the idea of “explanation.” 
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1.2 DEVELOPMENT OF THEORY 

The suggestion of Bacon that all we need to do is make observations has been rather 
uniformly criticized in the ensuing four centuries or so. We suggest that the criticism 
has not been entirely justified. One question is: what observations should one make? 
Obviously, we could suggest that the way to understand the universe is to have millions 
of humans observing — observing, only observing. It is obvious, indeed, that such a 
program would lead to an incredible mass of observational facts. The first missing 
ingredient in the Baconian prescription is given by the question: “What are we to 
do with all the facts that are obtained?” This question is interesting to the field of 
statistics, and eventually, to the whole of science, because it tells us that we have to 
do “data analysis.” It is interesting and curious that this is a term used to denote an 
activity that has been pursued by humans from the beginning of time, but which has 
been popularized since the 1960s in all discussions of statistics. 

It is surely pretentious to think that one can encapsulate the efforts of humanity to 
understand the universe in a few printed pages. But it is useful, we think, to attempt to 
present a broad picture that captures essential features of the human efforts that have 
been made. Interestingly, this is, of course, a problem of data analysis of its own. We 
can look over the history of science. None of us can do this completely, but perhaps 
we can see a general pattern which is not misleading. 

1.2.1 The Basic Syllogism 

The beginning is surely observation of entities that have some degree of permanence 
in time, usually entities that one can, so to speak “hold in one’s hand,” literally or 
metaphorically. These are looked at and classified. This is just Aristotelian classifica¬ 
tion. From this came laws as empirical generalizations such as “All A are B，” or “All 
entities which have attributes a and (5 have attribute 7 . s, It is interesting that this led to 
the basic syllogism: (i) All entities with a have 3; (ii) entity E has q; therefore, (iii) 
entity E has ,5. We find this presented as a mode of deduction, and this matter needs 
discussion. This syllogism is used widely and essentially in mathematical reasoning ， 
and without it, the possibility of the sort of mathematics we do would be impossible. 
For instance, every triangle has the property that the sum of its interior angles is 180°: 
here is a triangle; therefore, the sum of its interior angles is 180°. From where do 
we get the first part of the syllogism? The answer is simple. We prove it! But then 
we have to ask: “What do you mean by that?” What does it mean to say: “We prove 
it ”？ The answer is given in simple and not misleading form. We have defined t4 tri- 
angle.” We have developed modes of deduction that we accept as constituting proof. 
This whole process is very subtle. Just how subtle it is can be seen from the develop¬ 
ments of mathematics of the past two centuries or so. We see students in high school 
trying to write proofs in geometry. We see ourselves writing proofs that we judge to 
be complete. But we later see that our proofs are incorrect or incomplete. We see in 
the history of mathematics incorrect proofs by great mathematicians. We see proofs in 
which a questionable syllogism has been used with total unawareness that it has been 
used. The curious outcome of this phenomenon is that a proof of a mathematical the¬ 
orem is a sequence of statements, in mathematical form, developed from axioms that 
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are unquestionable, that the world of mathematics accepts as constituting proof. The 
purported proofs have been examined by thousands of mathematicians and found to be 
convincing. This should not be taken to be derogative and pejorative. The world of 
non-mathematicians should know that there is considerable controversy at the founda¬ 
tions of mathematics, a controversy that has arisen only in the past century or so. What 
are the axioms that we are to accept as indubitable? 

Our interest in the basic syllogism is not with respect to its use in mathematics, 
but its use in describing and explaining the real world — the world we can observe. It 
seems entirely clear that the use of the syllogism in this context is totally questionable. 
It is questionable from the point that it is empty. If we know that all A’s are B, and 
we know that X is an A, we are allowed to deduce “Therefore X is B.” But this is an 
empty deduction, because with respect to the real observable world, we cannot use as 
a premise that all A’s are B without having assured ourselves that X, which is an A, is 
B. We read texts on logic and we find the standard example: 

All men are mortal. 

Socrates is a man. 

Therefore, Socrates is mortal. 

Can we use this in deduction about the real world? The problem is, of course, the 
validity of the first statement — the premise. As we have said, by accepting that all men 
are mortal, that Socrates is a man, we have accepted the so-called conclusion. From 
one point of view, we are just playing a word game, and are hoping to impress our 
reader by using the very heavy word “therefore.” A curious example was given in the 
popular press recently. Consider the sequence of statements: 

All babies are nurtured in a uterus before birth. 

Individual X is a baby. 

Therefore X was nurtured in a uterus. 

The point of the example is that the uterus of a mother had been removed some years 
before. 

Surely one cannot say that the basic syllogism is nonsense. Without it, no science 
would be possible. What then is going on? The answer to this question is very simple 
in form. If the premise is to be useful, it must be established independently of the 
individual use of it; in other words, we must have the knowledge that all men are 
mortal, without having observed that Socrates is mortal. The upshot is then obvious; 
to establish the premise in such a way as to be useful in the syllogism, we must use 
induction. We can do no more than say: We have examined many humans and found 
that everyone of them was mortal. So we induce that mortality is a universal property 
of the class “humans.” Then if we see that Socrates is in the class “humans，” it must be 
the case that Socrates is mortal. 


1.2.2 Induction ， Deduction，and Hypothesis 

The punch line of this stream of thinking is that the use of the basic syllogism as a tool 
of science is based on induction. Or, with perhaps a harsh mode of statement, deduction 
as applied to the real world is totally ineffective without the establishing of the premise 
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by induction. Bacon in his Aphorism XIV stated (Spedding et al., Vol. VIII, 1863, 
p. 70): “Our only hope therefore lies in a true induction.” So we have to try to say 
something reasonable about induction. This is a very difficult task; we merely have to 
read the many volumes on the topic. To exemplify the difficulties, we quote Bertrand 
Russell (1959): 

But there is a much more general problem involved here, which has continued 
to bedevil logicians to the present day. The difficulty is, roughly, that somehow 
people feel induction is not after all as respectable as it ought to be. Therefore it 
must be justified. But this would seem to lead to an insidious dilemma that is not 
always recognized. For justification is a matter of deductive logic. It cannot itself 
be inductive if induction is what must be justified. As for deduction itself, this 
no one feels compelled to justify, it has been respectable from time immemorial. 
Perhaps the only way is to let induction be different without seeking to tie it to 
deductive apologies. 

This statement deserves comment. It tells us, clearly, that Russell (1872-1970)，surely 
one of the ten or so finest minds of the twentieth century, does not help us. It tells us 
that Russell cannot help us with the problem of understanding and carrying on science, 
because it is obvious, we assert, that one of the primary bases of science is induction. 

The foremost philosopher of science, perhaps, was C. S. Peirce (1839-1914) (see 
Gallie ， 1966). We cannot, here, give our detailed understanding (which may be falla¬ 
cious) of his ideas. Peirce distinguished three types of inference: deduction, induction, 
and hypothesis. The third he preferred to call “abduction” ， which, it seems, is a method 
of testing rather than of developing knowledge. Workers in statistics will have no dif¬ 
ficulty in appreciating this third type: a considerable portion of statistical theory and 
practice is the testing of statistical models. This, of course, entertains the possibility 
that a model or a theory can be shown to be false. The essential feature of science is 
that its theories can be “falsified.” This view is supported strongly, it appears, by the 
philosopher of science, Karl Popper (1902-1994). The only problem we see with his 
writings is an absence of an approach to methods of falsifying hypotheses or models. 
If we have a universal, “All A’s are B，’’ how are we to falsify it? Rather obviously, the 
only thing we can do is to continue to examine the A’s that we meet and see if they are 
B. A single occurrence of A and not B falsifies the universal. Suppose, however, that 
we have observed 100 A’s and find that they are all B. Does this justify the universal? 
Obviously not. It does, of course, suggest it. Can we quantify strength of support for 
the universal? Obviously, we should feel more confident if we found the occurrence 
with 100 A’s rather than 10 A’s. We shall not pursue this discussion except to state our 
view that this problem can be addressed only by making an assumption of randomness, 
which must be questioned, followed by tests of significance and tests of hypotheses. 
These are hypothesis “falsification” procedures. If our hypothesis is that an unknown, 
which is a constant in a theory takes a certain value or lies in a certain range we shall 
again use statistical tests, and associated statistical intervals. 

No theory put forward so far, even in physics, the so-called “Queen of Science ’，， 
has withstood the test of falsification. We do not bother to substantiate this; we merely 
advise the reader of this exposition to look over the sequence of theories. A facet of 
this must be discussed. Even though a theory, for example, the theory of gravitation 
or the theory of electricity and magnetism of some past period, has been shown to be 
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false, that theory maybe excellent for predicting a wide variety of outcomes of circum¬ 
stances. Our ordinary living is based, with the use of electricity, on what may be called 
the classical theory of electricity and magnetism, and in such application this theory 
is excellent, and obviously so. This tells us something that is highly significant. We 
cannot talk about a theory being absolutely true. We can only talk about a theory being 
true in a given context of application. Obviously, we have many such theories which 
we use every day, in devising, for instance, the “gadgets,” heating systems, cooling 
systems, transportation systems, sending man to the moon, etc., and in the nutritional 
“theories” that we use for plant, animal and human nutrition, etc., and the medical the¬ 
ories we use, such as those to cure deadly diseases, such as syphilis or gonorrhea, etc., 
or to palliate chronic diseases, such as diabetes. 

We do not have the time or the ability to pursue this line of thought. However, it 
leaves us with the view, that we hold rather firmly that the question of whether a theory 
is true, unconditionally, is not a well-formed question. We have to ask if the application 
of a given theory to a specified set of circumstances gives a prediction that is verified 
to be correct. 

1.3 THE NATURE AND ROLE OF THEORY 
IN SCIENCE 

We read writings, which we shall not cite, which take the position that there cannot be 
science without scientific theory. We may mention, however, the writings of Poincare 
(1854-1912) and the writings of Popper as indicating at least a strong tendency toward 
this view. We shall first exposit our opinion that this view is wrong. It is wrong for the 
very simple reason that there are varieties of science. 

It is absurd for any writer to claim that he or she can classify science into well- 
defined disjoint activities. However, any writer who pretends, that is, claims, to write 
about science in the broad sense must make an attempt and must recognize that there 
are, indeed, partially disjoint activities. 

1.3.1 What Is Science? 

A century ago, with some exceptions, some of which we shall mention, science was 
thought to consist of physics. Perhaps chemistry could be admitted to the domain of 
science, if only because much of it is based on physics. This view, we believe, persisted 
and still persists in the writings of philosophers of science. We shall not attempt to 
give our basis for this perception. What were the exceptions? Rather obviously, one 
had to admit that biology is a branch of science. As regards agriculture, it is obvious, 
being ironical, that is a problem for farmers, not for science. But should one admit 
psychology, sociology, economics (the so-called “dismal science”)，political science, 
demography (and all related problems, such as nature and amount of employment, cost 
of living, etc.), education, child development, ecology, traffic, and so on to the august 
realm of science? Our view is that we should do so. Furthermore, any exposition 
of philosophy of science that does not give this status to the areas of investigation 
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mentioned and to others that could be listed, should be regarded as being so defective 
as not to merit deep acceptance. 

It is useful, perhaps, to give some perceptions on the origin and history of the 
limitation of science in the way indicated. One has to go back to the Greeks, for whom 
science is what one knows, or science is what is true. This leads, of course, to the 
question of “What is truth?” This question has, obviously plagued humanity “since 
time began,” and it is clearly impossible to address this question in all its depths, for 
many reasons, including that of competence of the writers. 

To cut a long story short, and, hopefully, not to do rank injustice to the thinkers of 
the past, the basic idea of proving something, that is, proving a proposition to be true, 
is to take as true certain axioms and then deduce the proposition from those axioms by 
Aristotelian logic. An early formulation of this was the process of Descartes (1596- 
1650), whose prescription was to subject every proposition to extreme doubt. As a 
result of this process, one would reach certain propositions that cannot be doubted. 
One would, then, have a basis for a deductive argument. The problem with this pre¬ 
scription is, obviously, that a process of extreme doubting will lead to nothing certain. 
For Descartes, the first unquestionable proposition was “Cogito ergo sum” 一 “I think, 
therefore I am.” Whether we can accept this translation with the present-day meanings 
of words is not at all clear. However, it is surely the case that this, as a basic proposition, 
has been questioned severely over the centuries, most recently by Sartre (1905-1980). 
The whole history of philosophy since Descartes has been very tangled. Certainly, 
highly significant thinkers were the so-called British empiricists: Locke (1632-1704), 
and, especially, Hume (1711—1776) ， who, it seems, was the first to pose the problem 
of induction. If we continue the development, we come to Kant (1724-1804), who 
had two highly significant ideas. One is that behind the world of phenomena there is 
a world of noumena, about which we can know nothing. If this is a correct, even if 
brutally short, characterization, the idea is remarkably modem. A second Kantian idea 
is that there are two types of truth: a priori analytic truth, which is true by virtue of 
language, e.g.，“I am the father of my son，” and a priori synthetic truth. There can be 
no doubt about the first within a language, it would seem. The word “synthetic” means 
“about the real world.” The question then is simply: Are there any a priori synthetic 
truths? One may suggest that there are indeed none. The one such that Kant accepted 
is “Every event has a cause.” This leads us into the meaning of cause and causality, 
which we shall take up later. 

1.3.2 Two Types of Science 

The point toward which we are directing the previous discussion is absurdly simple. 
There are two types of science. The first type is descriptive science, in which man looks 
at the universe and describes what he sees. This surely characterizes the biology work 
of Aristotle (384-322 B.C.), the naturalist work of Charles Darwin (1809-1882), all the 
description of the biological world that we see in a good basic college text on biology, 
and so on. It is essential to realize that all description is incomplete. We may quote the 
Existential aphorism: “Essence is the totality of appearances” which we translate to 
mean: the real nature of an entity that is being observed is given only by “all possible 
ways of looking at it.” Obviously, we never reach this end and will never do so, because, 
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even with today’s observational techniques and apparatuses, the task is impossible, and 
also, critically, a highly significant part of science is the development of new ways of 
looking at things, e.g.，via the electron microscope, by the quite remarkable techniques 
of “chopping up” chromosomes and determining DNA sequences, to give just two, now 
common, but very new (in the history of science) observation techniques. 

The second part of science is the development of theory. This forces us to give 
a picture (our picture, of course) of the nature of theory in science. Our perception 
of this, as a concept of theory that was used before modern physics and quantum me¬ 
chanics, is as follows. One observed the world by certain observational procedures. 
These procedures possessed two critical properties. The first is that the observations 
made by one trained observer were essentially the same as those made by any other 
trained observer. If, in fact, two observers do not obtain essentially the same resultant 
observations, then the actual process of observation used by each has to be examined. 
If two observers appear to be following the same protocol of measurement and they get 
different results, then we conclude that the specification of protocol of measurement is 
incomplete and is susceptible to different implementation by different observers. This 
is, of course, a frequently traveled path of investigation. If a protocol of measurement 
cannot be specified so that two trained observers cannot obtain essentially the same 
observation by following the written protocol of measurement, then the measurement 
process is not well-defined and needs further specification. We have used the phrase 
“essentially the same,” We have to include this because much observation consists of 
placing an observed unit into a category or of attaching a numerical magnitude to the 
unit being observed. In the former case, it may be that the placing in a category is not 
entirely reproducible between observers, or even between repeated observations of a 
unit that is judged on other evidence not to have changed. A simple example of this is 
observation, say, of a mouse recorded on a film as being normally active, hyperactive 
or hypoactive; another is classification of individuals who are “mentally ill” as being 
“organic, psychotic or characterological.” Clearly, we are unable to describe all the 
problems in this area, or even indicate, even superficially what they are，except to give 
our perception of reports in this area, which is that psychiatric diagnoses are unreliable 
in terms of agreement of independently acting observers. In giving this, we do not 
intend to be pejorative: the problems are incredibly difficult, much more so than ob¬ 
serving growth of a plant, the endocrinology of an ant, or the behavior of an atom that 
has been hit by a particular type of particle. In the latter case mentioned, that of attach¬ 
ing a numerical magnitude to an object of observation, it is always the case that there 
is error of measurement，either of inexplicable variability of result of measuring an ob¬ 
ject that does not vary (according to all we know), or of measuring to a prespecified 
degree of “tolerance,” as when we say that the height of a human is 69 inches, mean¬ 
ing that our judgment is that the height is somewhere between 68.5 and 69.5 inches. 
Such grouping error of measurement is obviously inevitable; the extent of such error 
can be diminished by using an improved measurement process but it cannot be elimi¬ 
nated totally. We suggest that this is entirely obvious. If the point is accepted then the 
implications with respect to the use of continuous probability (or relative frequency) 
models are clear. Our use of a mathematical distribution such as the Gaussian distribu¬ 
tion to represent real observations of a numerical magnitude is an approximation that is 
convenient for many purposes but misleading for some purposes. Without discussion, 
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we give our view that one never observes exactly a random variable that is Gaussian, 
and furthermore, that we, finite individuals with finite observational abilities, do not 
observe events of probability zero, as a naive reading of some statistical theory would 
suggest. The point here is discussed by Kempthorne and Folks (1971, pp. 258-263) in 
connection with likelihood ideas. 

A second property is that the measurement process itself does not affect the prop¬ 
erties being measured, so that the achieved measurement can be regarded solely as a 
property of the object being measured. In all the common cases of physical measure¬ 
ment this is a property that is assumed, rationally, to be met. In the case of measurement 
of a mental attribute, it is a property that may be questioned. Certainly, in a psycholog¬ 
ical or psychiatric interview it cannot be assumed. At the level of measuring properties 
of elementary particles in physics, there is a fundamental indeterminacy in measuring 
two attributes; position and momentum, that is formalized in the Heisenberg (1901- 
1976) uncertainty principle. 

1.4 VARIETIES OF THEORY 

It is essential to distinguish several types or varieties of theory. Rather than attempt to 
characterize these by terms, such as weak or strong, which always carry pejorative and 
derogatory connotations, we shall try to give an idea of what we have in mind. 


1.4.1 Two Types of Theory 

There are, it seems, two basic types of theory. One type is exemplified by the theo¬ 
ries of classical physics. These are dominated by modeling a system of one or more 

particles through time. One observes attributes, say, a } b ...at times _One 

looks at the resultant data and one surmises that the variables a. 6 ,..are functions 
of time a{t)s b(t), _One can then conceptualize that their observations are realiza¬ 

tions of functions of time, which we can denote by A(T), B(T ),..such that these 
are general relations holding over time, T, which are subject to various mathematical 
relations, usually involving derivatives and partial derivatives. One then has a formal 
mathematical problem in the conceptual mathematical variables, which one can solve. 
Having then obtained an understanding of the mathematical functions A(T), B(T), 
and so on, one then translates this into functions a(t), b(t), and so on, that are to give 
predictions of what one will observe with the observable variables. Proof of the validity 
or rather justification, because there can be no proof, of the process is given by obser¬ 
vation, obtaining empirical relationships by data analysis, “translating” these into re¬ 
lationships among the conceptual mathematical variables, deducing the consequences 
in the mathematical formulation, and checking that these consequences are verified as 
predictions in the observable world. A general problem underlies the whole of this pro¬ 
cess, the problem of epistemic correlations; on the one hand, we have the observable 
real world, with observations given by observation protocols; on the other hand, one 
has a mathematical theory with mathematical variables; one wishes to use deduction in 
the mathematical system with formulae for and relationships among the mathematical 
variables to infer formulae for and relationships among the real-world variables. One 
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is using what are called epistemic correlations between observable variables of the real 
world with mathematical variables of the mathematical formalization. This process is 
so widely used in basic mathematical physics that one uses the same symbols for the 
real-world variables and the mathematical variables. In other areas of science, one finds 
mathematical variables used in a mathematical formalization that do not correspond to 
any real-world variables that can be observed. Under such circumstances, the nature, 
role, and utility of theory must, surely, be questioned severely. 

There is no disagreement that the so-called theories of physics are really and truly 
theories. We see what may be termed the full mix: observation, data analysis, concep¬ 
tualization to a mathematically exact theory, developing this theory to mathematical 
consequences of the elements of the theory, and finally verification by reference to the 
real observable world. 

To exhibit the contrast that we wish to emphasize, we ask the reader to consider 
some examples: the Aristotelian theory of tragedy, sociological theory, psychological 
theory, and the theory of plant and animal nutrition. In all these cases, we hold the 
view that the designation as “theory” is valid. It would seem that it should be quite 
unnecessary to make this statement, and we would not feel called on to make it if 
we do not see clear evidence that some individuals educated in the so-called “exact 
sciences” dismiss what some groups, e.g., sociologists or psychologists, describe as 
their theories，as being not theories at all but strings of highly imprecise verbal, that is, 
nonmathematical, “literary” expositions that cannot be given the status of theory. 

1.4.2 What Is a Theory? 


To form judgment of the question of whether an account of an area of human interest 
should be accorded the status of theory, it is helpful, we think, to first look at physics. 
Obviously, we cannot review the progression of theory in any direction, but it is useful, 
indeed critical, to glance over physical theory. We are told by Russell (1959), that phi¬ 
losophy and science began with Thales of Miletus (624-547 B.C.)), who is reported to 
have said, “All things are made of water,” a theory, even though entirely verbal. Anaxi¬ 
mander (610- ca. 546 B.C.)) questioned this, ‘‘Why choose water?” He said, it appears, 
that man derives from the fish of the sea — again a theory. For Anaximenes (ca. 570 — ca. 
500 B.C.)), the basic matter was air. Later for Pythagoras (ca. 569-ca. 475 B.C.)), the 
whole of reality could be captured by numbers and mathematics. For Heraclitus (ca. 
600 B.C.)) the real world consisted of a balanced adjustment of opposing tendencies, 
then he chose Fire as the primary ingredient. These are mere examples from the suc¬ 
cession of theories that were held at one time or another. Somewhat later Leucippus 
(480-420 B.C.)) put forward the theory that the world is made up of “rigid, solid，and 
indivisible” atoms. This theory was developed by Democritus (460-370 B.C.)). Neces¬ 
sarily, we do not enumerate the Socrates — Plato story, which used the theory of ideas, 
except to give our impression that this was both sterile and highly captivating to the 
point that it has influenced science strongly over the millennia. Also, rather clearly, 
its main thrust was towards ethics and the nature of man. Books on the nature of the 
thoughts of Socrates (469-399 B.C.)) and Plato (427-347 B.C.)) would easily fill a small 
library. The scientific ideation of Plato was that everything could be reduced to geom¬ 
etry, which much later was reduced to algebra by Descartes. Next came Aristotle who. 
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we suggest, was perhaps the first real scientist. He worked on classification of animals 
and did research in marine biology. The assessment of Aristotle, by the later world, 
and particularly in comparison to Plato is very mixed — on the one hand, just a pale 
imitation of Plato and on the other hand the first scientist and philosopher of science, 
as well as having made vast original contributions to human knowledge. After a dor¬ 
mancy of centuries, apart from some Moslem thinkers, such as Avicenna (980-1037), 
in opposition to the writings of Aquinas (1225-1274), Roger Bacon (1214-1294) can 
reasonably be regarded as initiating modern science with the thesis that we must re¬ 
sort to experiment. Bacon was condemned by the Pope and spent 12 years in prison. 
Again summarizing a huge history, and perhaps rather unreasonably, we see the helio¬ 
centric theory of Copernicus (1473-1543), the works of observation of Tycho Brahe 
(1546—1601)，and the data analysis of Kepler (1571-1630), events which must surely 
be regarded as early and critical in the development of physical science. During the 
same period Francis Bacon (1560-1626) produced his “Novum Organum.” According 
to Russell (1959) “to replace the evidently bankrupt theory of the syllogism,” Bacon 
put forward the method of induction. Again “cutting through” a long and fascinating 
history and talking only about science, which was physical science, we see the the¬ 
ories of Boyle (1627-1691), Lavoisier (1743-1794), Faraday (1791-1867), Maxwell 
(1831-1879), and so on, just to mention a few of the significant names. The point 
of the present discussion is to indicate the succession of theories — theories that are 
mathematically based. A curiosity of the present time is that the Einstein (1879-1955) 
axiom that nothing can exceed in velocity the speed of light is now being questioned, 
and it seems seriously. So, we see, no axiom of a theory of the real world, no basic 
proposition about the real world, survives the so-called extreme doubting of Descartes. 
There is, at base, no single generalization about the real world that should be taken as 
undoubtable. The life of undoubted generalizations of the past has decreased over the 
centuries, and much more rapidly so in the twentieth century with relatively huge and 
growing scientific efforts of mankind. 

It is our view, then, that there are varieties of theory. There are systems of the real 
world that can be idealized into very simple ones, with the aid of ideas such as mass 
and force. Furthermore, these systems can be isolated from the rest of the world, as 
in the physics laboratory at the elementary college level and even at a more advanced 
level such as the now easy experiment of weighting an electron. The same happens 
in chemistry as is too obvious to need discussion. But the actual world is so much 
more complicated that to try to place biology, medicine, psychology, and so on in the 
so-called exact physical science mold is little short of ludicrous. 

This rather forthright statement needs, perhaps, substantiation. So we give some 
obvious examples. Consider plant growth, for instance. We have no problem in being 
reductionist, that is reducing, in our minds, a plant or a tree, say, to a physical system. 
A standard college text on plants tells us about the system, roots, stems, leaves, flow¬ 
ers, and so on; we can see, to some extent ，the vascular system; we can, in some cases, 
feed the plant radioactively tagged chemicals, via the soil in which it grows, and we 
can follow this material as it progresses through the plant. We know a huge amount 
about plants, but, also, there is a huge amount we do not know; for instance we may 
ask what “really” goes on in mitochondria, what do the Golgi bodies do? These are 
merely two examples. Then we know that the growth of plants depends on many types 
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of nutrients, nitrogen, phosphate, potash, and so-called minor elements; on the amount, 
nature, and timing of water supply to the plant; on the climate the plant experiences, 
and so on. Then we have to adjoin the demonstrated experiential/acf that genetics is 
important. Perhaps, in the not too distant future, we shall have a technique by which we 
can determine the whole DNA sequence in the chromosomes of a plant. This will be 
represented by a string of well-defined symbols, doublets from C, G, A, T. The reduc¬ 
tionist hypothesis is, essentially, the hypothesis that if we know everything described 
above and a huge variety of other aspects not mentioned, then we could explain why 
one ash tree sheds its leaves three weeks earlier than another ash tree which grows 
some 70 feet apart. This is plainly silly. We shall never have enough data to establish 
the types of law that we see in physical science. Even if we had all the data on plants 
that a group of well-trained biologists regard as relevant, we shall be in the position of 
trying to model, for simplicity, one attribute such as plant height at maturity in terms 
of thousands, even millions, of potential explanatory factors. 

Now let us take another example — humans. The complexity of the biophysical 
system in humans is really quite fantastic. Surely, there is no need to exposit why a pure 
reductionist attitude and approach cannot be generally followed. We can be reductionist 
about certain phenomena, such as certain genetic diseases, and many other medical 
workers could enumerate. But it is plainly silly or ludicrous to attempt to formulate 
a system of differential equations, say, to explain human growth, these equations, of 
course, involving all or even a small fraction of the factors that we know to be involved. 

Obviously, the same sort of discussion can be applied to psychology, sociology, 
and wildlife studies, to mention just three areas of science. 

A consequence of this argument is that in many areas of science the modeling 
can only be simple, and, often, not even mathematical. As we have said, to use this 
experiential fact to dismiss many areas of science as not being “real science” is stupid 
and myopic as well as arrogant. An example is given by Linus Pauling (1901-1994), 
winner of two Nobel Prizes, one in science and one in peace. The one in science was 
surely one for reductionist science in an area that could be reduced. In his later years, 
Pauling exposited over the nation his theory that massive dosing of vitamin C will 
prevent the common cold (Pauling, 1970). Additionally, Pauling has a theory, a verbal 
one, as to why this should happen, this theory relating to the ascent of man in tropical 
environments (Pauling, 1970). Our point is that Pauling has a theory. One may not like 
it. One may question it. It is a falsifiable theory, obviously, by means of comparative 
experimentation. 


1.5 THE PROBLEM OF GENERAL SCIENCE 

In the pure physical sciences, one can isolate “small” systems from the rest of the 
world (perhaps at the cost of vast concrete enclosures). Any such small system can be 
manufactured independently by many scientists. The proof of validity is that different 
scientists following the same protocol of investigation obtain equivalent results (apart, 
perhaps, from measurement error). 
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1.5,1 Two Problems 

When we turn to general science, in contrast, a first problem is that we cannot manu¬ 
facture essentially or nearly identical small systems. In human biology, we cannot find 
two humans who are essentially identical. Even in the case of so-called identical twins, 
it is the case that the two members will not have experienced the same environment 
at the same moment of life. What then are we to do to attempt to falsify Pauling’s 
theory, for example. In agriculture, we cannot find two plots of land that are identical. 
Two plots may look identical to the man in the street, but one can make many physi¬ 
cal, biological, and microbiological measurements, to show that the two plots are not 
identical. Consider, again, a rather advanced and frequently used, surgical procedure, 
the coronary bypass operation. Can one find two identical humans so that we can have 
a simple comparative experiment, with one being a control and the other receiving the 
operation? Obviously, we cannot. Can we model, mathematically, the heart system so 
that we have a theory to which we can apply the tight deductive approaches of math¬ 
ematics? Again, obviously not. What then are we to do? Chapters that follow on 
randomization give one suggested process. 

A second problem is that we wish to draw conclusions about a population of 
units, for instance, humans who at present or in the future will have the problems for 
which coronary bypass surgery is a possible treatment. The standard way (except to 
Bayesians) to approach examination of a large defined population of units, for instance, 
humans, is to use the ideas of random sampling — that is, draw a sample at random from 
the population, examine the sample and attempt to make some sort of inference about 
the population. But this prescription cannot be applied to the populations of the future 
for which we wish to “make an inference.” We do not know the set of humans who 
will be candidates for bypass surgery in the future. How then are we to attempt to form 
judgments? 


1.5.2 The Role of Data Analysis 

This problem is, obviously, of vast importance. Unfortunately, it does not seem to be 
generally recognized to be one in common statistical circles. So we give a little discus¬ 
sion. Given a set of human subjects in 2005, we can perform a comparative experiment. 
Having performed the comparative experiment, we have to attempt to determine if the 
response to treatment is in the same direction for all groupings of subjects that we can 
envisage; i.e., is this so for males and females, for nonsmokers and smokers, for blon¬ 
des and brunettes, for thin and fat people, and so on. This is obviously an impossible 
prescription to fill. All we can do is to do data analysis in which we look to see if 
such factors of classification (categorical, ordered categorical, or arithmetically based) 
give evidence of having explanatory power with respect to outcome of the experiment. 
We shall discuss this in the later text under the names of “additivity” and concomitant 
variable analysis. However, we inform the reader that there are no simple answers. The 
discipline of statistics suggests data analysis procedures. 
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1.5.3 The Problem of Inference 


The outcome of such analysis may be, for instance, that thin people do not respond well 
and that fat people do respond well. Or that individuals of blood group O respond well 
and individuals of other groups do not. We are merely giving examples. Obviously, in 
this type of activity “one swallow does not make a spring,” or, more explicitly, one such 
study is mere evidence, perhaps strongly suggestive. So studies have to be repeated 
under different naturally occurring circumstances, with different groups of people, for 
example, South European, North European, African, Oriental, and so on. The outcome, 
one hopes, is very much the same in all these groups. The wider the groups of people 
experimented with, the more confidence one will have, in an unquantifiable way，to 
“extending” a resultant inference to groups not represented in the studies and the more 
confidence one will have in extending the inference to John Smith in August, 2010, who 
is 57, white, blood group 0，" .. It is clear that the extension of an inference from data to 
this John Smith is not one that can be made tight. The inference is subjective. It will be 
made by the controlling physician, and you, the patient, can do nothing except to hope 
that the physician makes good judgments. One will be able, perhaps, to see data that 
enables one to quantify formally the judgment ability of the physician; as, for instance, 
it can be found that he had met 25 cases “like” the one under consideration, made a 
decision, and then found that he was correct, in some sense, in 24 of the cases. One 
could formalize this problem somewhat. One could say to the physician, “You, surely, 
understand coin tossing, so that you understand what a probability of jq is; simply 
the probability of getting four heads on four successive tosses of a (tested) penny. So, 
now please give me your judgment of your probability that this proposed operation 
will benefit me.” We can be quite sure that practicing physicians follow some route of 
this sort. Presumably and hopefully, this judgment will be based on literature search 
and on actual experiential facts. Also, however, it is necessarily based on incomplete 
analogy, expressed by the physician somewhat as follows. “You，John Smith, are a 
unique individual. No one else has your genetic structure: no one has had your life 
experiences; no one is in exactly the same configuration as you; but my judgment is 
that you are sufficiently like the humans in such and such studies, that I feel justified in 
applying, say，the 95% chance of success observed therein to you.” 


1.6 CAUSALITY 


It is obviously critical that a general discussion, even if brief and even potentially par¬ 
tially misleading, be given of the idea of causality. This is a topic with history going 
back to the pre-Socratics; also, we are confident，that it occurs in early non-Greek writ¬ 
ings; and it surely occurs in ancient philosophy of the East. We, not only in science, 
but in nearly all human mental activities, use some concept of causation. As the ensu¬ 
ing discussion indicates the word “cause” has been used for millennia and at present 
in several senses that are not at all consonant with each other. Because the design of 
experiments is directly aimed at one type of causality, it is essential to try to achieve 
a coherent and useful understanding. Underlying every use of the word “cause” is a 
primeval concept that everything that happens had a cause (cf. the Kantian a priori 
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synthetic truth). 

To begin, let us list some examples, “higgledy-piggledy” of statements using the 
words “cause ， caused, and because of.” Our reason for writing these in haphazard order 
is a simple one, namely, to exhibit the fact that the concept of cause is used with a 
recklessness that is really quite appalling. We shall not always complete the statements 
and shall give ... to indicate that a proverbial intelligent and educated person, could 
complete the sentence in various ways. 

The sky is blue because … 

A rainbow is caused by … 

Such and such happens because of the second law of thermodynamics. 

The apple fell because of gravitation. 

Genes cause IQ. 

Radiation causes cancer. 

Socioeconomic status is one of the causal variables of crimes. 

The tight money supply causes stagflation. 

Bill Jones caused the automobile accident. 

The hooter in the factory at Birmingham caused the workers to stop for lunch. 

We have day and night because the earth is rotating. 

1.6.1 Defining Cause ， Causation，and Causality 

We now attempt to encapsulate the ideas of Aristotle on cause and causation. We use 
Runes (1962) as part of our information sources. First on the nature of cause, Aristotle 
distinguished four interpretations: (1) the material cause out of which something arises; 
(2) the formal cause, the essence determining the creation of a thing; (3) the efficient 
cause, a force or agent producing an effect; and (4) the final cause or purpose. This 
language is surely perplexing. Why should one equate cause and purpose? Obviously, 
one becomes enmeshed in teleology, that everything is in the world for some end, 
some “telos.” Not unsurprisingly, both the idea that there must be a prime, necessarily 
single, cause and the idea that life has a “telos，” led Aquinas to one of his proofs for the 
existence of God. a proof rejected by many outstanding philosophers since, especially 
Kant. Newton (1643-1727) was a great believer in a concept of causation, that to every 
effect we must assign a cause, though just what Newton really meant by this obviously 
innocuous proposition should, we suggest, be considered moot and uncertain. 

When we turn to causality, we find ourselves drawn into an even deeper quagmire 
of words and statements. Causality is the relationship between cause and effect. Given 
the obvious obscurity about cause and also of effect, it is hard to make progress. We 
have no doubt, in these days, about accepting the idea that radiation, a “cause,” pro¬ 
duces cancer, an effect. M. T. Keeton in the aforementioned Dictionary of Philosophy 
(Runes, 1962) lists nine definitions for causality; they are so important that we have no 
alternative but to attempt to encapsulate these in very few words: 

1. a relation between events, processes or entities in the same time series subject to 
several conditions; 

2. a relationship between events, processes or entities in a time series such that 
when one occurs the other follows invariably; 
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3. a relationship, etc., such that one has the efficacy to produce or alter another; 

4. a relationship, etc., such that without one, the other could not occur; 

5. a relationship between experienced events, processes or entities and extra-experiential 
events, processes or entities; 

6. a relation between a thing and itself (self-causality); 

7. a relation between an event, process or entity, and the reason or explanation for it; 

8 . a relation between an idea and an experience; 

9. a principle or category incorporating into experience one of the previous. 


If the reader is perplexed with all this, we have great sympathy. 

A shorter classification is given by Nowell-Smith (I960). He distinguishes three 
senses which we attempt to summarize: 

I. Human agency — to cause an event is to perform an action which produces a 
prechosen outcome; 

II. Causes in Nature_to characterize a natural event that produces a certain precho¬ 
sen outcome; 

III. Cause as explanation. 

This third sense begs the question: “What is explanation?” of course, on which 
many are curiously silent. This seems to be a conceptualization like I and II without an 
active agent or a natural agent. It can always be used in answering the question “Why?’ 
It may be a state of affairs, as in the proposition “Height at age 6 causes height at age 
12.” The reader may find it useful to place the usages given earlier into categories (1) 
to (9), or in categories I, II, and III. In spite of the huge use of sense III, the usage is 
murky. Mill (1806-1873) thought selection of one factor as cause from the whole set 
of antecedents was arbitrary. Nowell — Smith says: “… alternative explanations do not 
exclude one another; any number of them can be true, and the cause will be relative to 
the interests and abilities of the investigator.” This, we suggest, “lets the cat out of the 
bag.” The use of cause in sense III is remarkably vague, fuzzy, and indeterminate. 

It is useful, we think, to recall a famous example of Bertrand Russell, the hooters. 
One is an observer looking at a factory in Birmingham, England; one notices that when 
a hooter is sounded, the workers stop work for lunch. The event A — “workers stop 
for lunch” invariably follows the event B — “the hooter sounds at the factory around 
midday.” We have invariable succession. Therefore, event B causes event A. Who 
can question this? It is surely obvious. But there is, we suggest, a hole. Suppose 
you are an observer in Glasgow, some 200 miles away. Factory hooters are used there 
also. Also, you have a screen which you have excellent reason to trust as conveying 
what is happening at the factory in Birmingham. However, you do not hear sounds 
at Birmingham. What will you see? The event A*, “the hooter sounds in Glasgow” 
is invariably followed by the event B, the workers stop for lunch in Birmingham — the 
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same B as before. Hence it is obvious to you that A* causes B. Surely! But this is 
plainly ludicrous. Suppose we really wished to check the proposition that A* causes 
B. To do so is easy; most children would, we surmise, tell one what to do: Simply, 
just arrange for the hooter in Glasgow not to work, e.g.，cut off its power or, whatever. 
Then see if B occurs. What will happen, of course, is that the observer will note that 
B does occur. The point of this rather silly example is merely to indicate that the idea 
of inferring causation from invariable succession is a rather hopelessly inadequate and 
improper act. This problem is, of course, rather simple to look at. Merely stop the 
hooter in Birmingham and see what happens. 


1.6.2 The Role of Comparative Experiments 

We do not wish to be derogatory of the various uses of cause and causation. Obviously, 
humanity has found the various usages useful. The use of cause as explanation as in 
explaining what happens in a physical process by means of a law — even a conceptual 
theoretical one rather than a purely inductive generalization — has been fantastically 
useful. The absence of role of theory has been the perennial and valid criticism of 
Bacon’s “Novum Organum.” 

This having been said, however, we take the view that cause in sense I, that of 
human agency, is the critical one for very many, perhaps most, of the concerns of 
humanity. If we were to radiate humans, would we later find cancer in them? “How can 
I make the grass on my lawn grow?” is a perennial question of suburban America (and 
elsewhere, perhaps even in Russia). Our horticulturists have a partial answer: Put on 
nitrogen. We have experimented, we have done comparative experiments and we found 
an invariable succession. Event or action, A: “Put nitrogen on a lawn” is followed 
invariably by event B: “The grass grows.” Outside the sphere of purely theoretical 
science, the idea underlying this “inference” permeates real world science. The big 
philosophical movement that underlies this approach is “Pragmatism,” formulated by 
the leading United States epistemologist of all time, C. S. Peirce. It is summarized in 
the oft-quoted statement of Peirce (1963, p.6): 

In order to ascertain the meaning of an intellectual conception one should consider 
what practical consequences might conceivably result by necessity from the truth 
of that conception, and the sum of these consequences will constitute the entire 
meaning of the conception. 


We do not claim, really, to understand exactly what Peirce was saying — we merely 
have glimmerings. (What is the meaning of “by necessity,” a phrase so often used in 
epistemological writings.) We do, however, interpret this and other writings, especially 
those of John Dewey (1859-1952), to conclude that a necessary and even critical pro¬ 
cess in all science, whether “pure” or “applied” (an unfortunate but commonly used 
dichotomy) is the process of comparative experiments. Does vitamin C prevent colds 
or cause absence of colds? The only way to form a good judgment on this is the con¬ 
trolled experiment in which some individuals receive massive doses of vitamin C and 
others do not. Then compare the outcomes. It would be plain silly and page-filling 
to make a list of questions and problems that are attacked by the comparative experi¬ 
ment method. It is hard, even, to think of a human problem of the biophysical nature 
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and even psychologic, which cannot be attacked by the comparative experiment. We 
close with just one example. We are all concerned about disease; we all want allevia¬ 
tion of some pain or discomfort at one time or another time. So we have investigators 
studying methods of intervention — take this pill, have that operation and so on. The 
last requirement for a submission of a request for licensing a drug use is the clinical 
trial — a comparative experiment. Without such an experiment that is judged adequate ， 
an application is simply not even considered. The immediate foregoing constitute the 
case for the importance of comparative experiments. 

We must now cover a general point. The simple comparative experiment uses ex¬ 
perimental units, mice, humans，pieces of steel, etc., and with a proposed treatment, T, 
includes also a control, C, say, which is nothing but absence of T. One then compares 
the two groups of outcomes, those following T and those following C. In most so-called 
inexact sciences it is critical that a study possess “adequate controls,” to use a hack¬ 
neyed but useful expression. It appears necessary to state an “obviosity，” something 
that is obvious. The behavior under the control may have been established with indef¬ 
initely strong validation by previous investigation. If we want to investigate the effect 
of applied heat to, say, a beaker of sulfuric acid, to take an absurdly simple example, 
one does not have to do the full comparative experiment of having, say, 6 beakers of 
acid which merely sit on the bench and 6 beakers to which heat is applied. One knows 
what will happen to the controls. If we are studying cancer of the colon and contem¬ 
plate a surgical procedure we do not have to obtain, say, 12 patients and then merely 
maintain 6, while treating 6 by the surgical procedure. We know, empirically what will 
happen to the controls. In the case of coronary bypass surgery, on the other hand, we do 
not know how the controls will react, so that we have a huge comparative experiment 
known as the Coronary Artery Surgery Study (CASS) sponsored by the National Heart, 
Lung and Blood Institute (see e.g. CASS Principal Investigators, 1983a, b; Rogers et 
al., 1990; van Belle et al., 2004 (Chapter 20)). We could trace down perhaps 100 such 
large comparative trials in human medicine, and we could track down thousands in 
health research organizations, including so-called “drug houses.” We could track down 
many thousands of comparative experiments in agriculture and biology over the world. 
And so, on and on. The subject of “Design and Analysis of Experiments” needs no 
justification by philosophers of science or mathematical statisticians. 

Finally, there is a critical point that must be discussed, at least, in a preliminary way. 
A comparative experiment consists of treating experimental units according to various 
protocols of experimentation, with necessarily one unit receiving only one protocol. 
Having done the experiment, what can one conclude? Clearly, nothing more at best 
than that protocol A led, say, to recovery from such and such a disease; or that the total 
act of putting 40 pounds of nitrogen per acre on the lawn led to a fine lawn. Having 
found such a conclusion, it is both necessary and inevitable that one should ask: What 
in the protocol produced the effect? Recall the hoary, but informative, example that 
goes as follows: When I drink vodka and tonic, I get drunk; when I drink a scotch and 
water, I get drunk; when I drink gin and tonic water, I get drunk. What then is the cause 
of my getting drunk? I ponder the question and come to the conclusion: the only thing 
common to those interventions that make me drunk is that each intervention includes 
my drinking water. The example is laughable — but we must interpose 一 to us with our 
knowledge. It is, of course, easily questioned by asking: Does drinking water make me 
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drunk? Obviously, here, we have a historical control with respect to the hypothesis and 
then, obviously, the causal inference is merely silly. 

The example illustrates, however, a remarkably critical aspect. The isolation from 
a protocol of one particular component as being the “real” causative agent is in some 
cases a very simple matter, as in our hoary example above. But in general this can be 
very difficult. The health research field is interesting, and perhaps, exemplar. It is not 
enough to know that drug X cures an ulcer (supposing that it does). It is regarded as 
essential to have evidence and hypotheses that are consonant with accepted scientific 
knowledge of the mode of action of the drug; in what way, does it produce its effect? 
This brings in, of course, the whole field of pharmacology. This type of epistemolog¬ 
ical problem pervades science. It is an attempt to justify the efficacy by reference to 
established scientific laws. 

This final point has a rather curious aspect that arises in psychological and soci¬ 
ological experiments. The mere fact of intervention, independently of the nature of 
the intervention, may produce an effect. This is strongly reminiscent of the possibility 
that the act of observation alone may produce an effect. The only way of attacking 
or falsifying such an explanation is, again, by comparative experiment appropriately 
designed. 


1.7 THE UPSHOT 

We have given a very long discussion of basic ideas. We do not apologize for the 
length. We have tried, foolishly perhaps, to capture or encapsulate in a few short pages 
the whole of the intellectual efforts of Mankind to “come to peace with” the unending 
anxiety of Mankind to understand and control the processes, haphazard though they 
may seem to be and must have seemed to be to, say, the educated Greek of 1500 
B.C. The outcome of this effort is, we suggest, to convince the reader that the role of 
experiments and in particular that of comparative experiments and of interventional 
studies are critical in the grand effort. If this is accepted, then there is a clear need for 
a book on “Design and Analysis of Experiments.” 


1.8 WHAT IS AN EXPERIMENT? 

To focus briefly, and on a less philosophical basis, on the nature of this book, it is 
appropriate to ask and discuss this question. 

An experiment is “deliberate observation under conditions deliberately arranged by 
the observer” (Stebbing, 1961, p. 302). This statement is acceptable as a beginning, but 
it is surely not adequate for general use, because there are many sorts of experiment, 
just as, for instance, there are many sorts of mammal. It is important to make some sort 
of classification. If, for instance, John Doe goes to New York to look at Rockefeller 
Center, this action could be called an experiment, according to the quasi-definition 
above. 
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1.8.1 Absolute and Comparative Experiments 

One, by now fairly ancient，partition was that into absolute experiments and compara¬ 
tive experiments. Unfortunately, the ideation behind this dichotomy is not at all clear. 
The determination of the weight of an object, for example, a human, or a sack of sugar 
is based on the idea that the object has a definite fixed attribute, the weight, which is the 
result of applying a measuring apparatus to it. The determination of the speed of light, 
assumed to be a constant attribute of light is obtained by a particular process of mea¬ 
surement. The process of measurement may well be based on theory, which consists 
of a mathematical structure incorporating mathematical variables that are considered 
to represent actual real world properties. In any of the examples of this paragraph, the 
hope is that if the whole process is repeated, one will obtain the same result. If the result 
is a categorical one such as a color, this may happen. If, however, the result is an arith¬ 
metic number, like weight in kg, grams, milligrams, …， it is entirely unusual for this 
to happen. One need only experience the taking of one’s weight on a well-graduated 
balance, e.g.，with gradation of - or | of an ounce. Repetitions of the measurement 
process will not yield the same number. The assumption that is made, rather uniformly, 
is that the numbers that are obtained are independent “realizations” of a random vari¬ 
able, which, furthermore, is in fact a standard Gaussian random variable, that is, with 
zero mean. We place the word realizations in quotes because an actual realization of 
such a random variable will be an infinite decimal. We say that this assumption has 
been made by essentially everyone in this area. A few workers have suggested that the 
appropriate distribution is the double exponential with zero mean. 

The idea of repetitions, along with the idea of replication which permeates this area 
of design and analysis of experiments, is very difficult to characterize. Suppose John 
Doe makes a measurement at 9:00 a.m. Then has a cup of coffee, and then repeats the 
measurement at 10:00 a.m. Is this repetition a replication? John Doe at 10: a.m. is 
different from John Doe at 9:00 a.m. Also, of course, what is being measured may be 
different over the two times. Curiously, there is very little discussion of the semantic 
problem that underlies the ideas. Repetition of an observation requires constancy of 
what is being observed. If what is being observed is not constant, then repetition does 
not have constant significance. It is clear that a basic component of education in the 
physical sciences is training in observation so that different observers will obtain “the 
same result.” This does not happen, of course, with any measurement problem of 
an (assumed) underlying continuous variable. If we assume, for instance, as seems 
reasonable, that x equals the weight of John Doe at 10:00 a.m. can be any real number, 
then x cannot be observed. The simplest model of actual observation is that the real 
line is partitioned by a grid, and actual observation consists of deciding that the value 
sought lies in a particular cell of the grid. This will not be totally satisfying because 
the observer will meet cases such as the grid being in intervals of. 1 and the observer 
will meet observations which appear to him to be at a good point. If this happens at 
all frequently and if, say, the difference between an observation being in [5, 5.01] and 
being in [5.01, 5.02] is important, then a finer grid must be used. 

It is necessary, obviously, that measurements made by scientists agree — that mea¬ 
surements have interpersonal validity. It is not at all obvious that this will happen with 
a measurement process. So it is necessary that a study be made of the process, by 
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repeated measurements by the same and different measures. This involves the con¬ 
struction of a design and protocol of such a study. 

1.8.2 Three Types of Experiments 

We distinguish basically among the following three types of experiments. 

Type I: The observation of an assumed constant. Examples are the measurement of 

(i) the velocity of light; 

(ii) the mass of an electron; 

(iii) the gravitational constant; 

(iv) the conductivity of a sample of water. 


Any book on chemistry describes a multitude of fixed properties of chemical sub¬ 
stances. If a measurement gives variable results in which the variability is greater than 
that explainable by pure measurement variability, the natural assumption is that the 
material being measured is not constant. 

Type II: The measurement of a property of a population the numbers of which have 
variability. Obvious examples are 

(i) the average income of the population of families in the USA; 

(ii) the average age of automobiles that can use the roads of the USA; 

(iii) the average of the number of years of education of the adults of the USA; 

(iv) the area in the USA that has been planted to com in the year 2000. Any book on 
economics and sociology mentions many such properties. 

In Type I, there is strong evidence that there is an underlying constant and the only 
problem is that there may be, or more generally, will be measurement errors. In Type 

II, there is the assumption, usually somewhat well based ，that there is an underlying 
constant for each member of the population. 

The present book is concerned with a very different situation, which we call Type 

III. It is best exemplified by biological examples, but the same considerations arise 
throughout all technology, including engineering, and agriculture. Suppose we wish to 
develop a diet to promote growth in children of, say, age two years. We know from 
utterly casual observation that children grow at various rates. Suppose we quantify 
growth by measuring height at two years and at three years of age. We know that 
we can measure height easily within \ inch. We know that growth is a very variable 
process. Some children grow very little from age two to three. Others grow a lot. The 
variable we are trying to understand and modify is height gain from age two to three. 
We do not know what diet to use but we have ideas. The only thing we can do is to run 
an experiment comparing the diets that we judge to merit consideration. 

This situation is entirely different from those of Type I and II above. In both of these 
types there is a true value with the possibility, or, in fact, certainty of measurement or 
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observation error. In Type III it is impossible to have total replication of any diet. 
To obtain this, we would have to possess two or more children of age two, who are 
totally alike and who will be exposed to the same environment from age two to age 
three. The only thing we can do is to apply each diet to several two year old children 
and observe them from age two to age three. In doing this，we have replication over 
children variability. We can obtain measurement variability by repeated measurement 
using a measurement process that is under statistical control. This Type III experiment 
is commonly called a comparative experiment because the experimenter is comparing 
what are naturally called treatments. It is obvious that comparative experiments have 
been used and, indeed, should be used in all endeavors of critical inquiry, be this in 
science, and in this we include all sciences, in industry, especially in the manufacturing 
process, or in government, for developing certain types of social policies (see Northrop, 
1948; Scheffler, 1967). 

1.9 STATISTICAL INFERENCE 

As discussed above and as will become clearer in subsequent chapters, we are con¬ 
cerned about the effects of interventions. The sole purpose of the performance and 
analysis of experiments is therefore the drawing of inferences about the effects of treat¬ 
ments. 

1.9.1 Drawing Inference 

We have to present our opinions on what meanings we attach to the term “inference” 
and to the phrase “drawing of inference,” In Webster's Dictionary (1948, p. 1273) in¬ 
ferences are classified as mediate (= drawn from more than one proposition or premise) 
or immediate (= drawn from a single premise). The making of an inference is the act 
of passing from one judgment to another, or from a belief or cognition to a judgment. 

The field of statistics has been concerned with drawing judgments from observa¬ 
tions. 

Let us give a few examples. You, an ordinarily educated citizen, were exposed 
to various plays, allegedly written by William Shakespeare. This exposure we call 
the observation. Then you hear that a question has been raised on whether the plays 
were written by said Shakespeare or by someone else. Your task is to pass from the 
observation to your judgment. 

A second example is that you are to form a judgment on whether the sun will “rise” 
tomorrow. 

A third example is that an observation consists of the result of n tosses of a two- 
headed coin, which is r heads and (n — r) tails. You are to make a judgment of the 
result of a (n + l)th toss. 

These three examples have plagued philosophers of science for centuries. They 
exhibit differences in content. In the first case, the simple judgment is yes or no, though 
clearly a judgment could be that the author was Bacon, or perhaps others of large extent. 

In the second and third cases, a classical answer was to assume that the event was 
a realization of a binomial trial, p. Then we were to assume that p is a random variable 
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uniformly distributed over the interval [0,1]. And then to assume that we have observed 
r successes (heads) in n outcomes. By writing down the joint probability, one can 
obtain the conditional probability of p, given the observation of r successes from n 
trials, the posterior distribution of p, as 

f P ost(p)dp (X p r (l _ p) n - r dp. 

The posterior expectation of p is then (r +1 )/(n + 2). This result, that the “probability” 
of a success after r successes in n trials is (r + l)/(n + 2) was known as the law of 
succession. This “result” received wide support from philosophers and observational 
scientists for decades, even centuries. The whole story is absurd. What was probabil¬ 
ity? Why should the “true” probability be “distributed” uniformly over the interval [0, 
1]? Where did the representation of the result r successes from a trial as a realization 
of the result of n independent Bernoulli trials come from? 


1.9.2 Notions of Probability 

Quite a different use of an idea of probability arose in connection with games of chance: 
for example, with dice tossing and the question of what is the probability that a toss 
of three coins will yield three heads. This question is, of course, totally unanswerable 
without assuming a probability structure, that is, a class of elementary events with asso¬ 
ciated probabilities (which were equal). These elementary probabilities were assumed 
to be frequencies of outcomes. The outcomes were then frequency probabilities in an 
indefinitely large number of repetitions. The probabilities will not be realized unless 
the elementary assumed probabilities will be realized in an indefinitely large number 
of repetitions. This mode of development became a very significant portion of math¬ 
ematics, particularly in the development of asymptotics. This theory has little bearing 
on inference except to make a judgment of where the probability model is reasonable. 

A very different formulation of probability was developed by J. M. Keynes (1883- 
1946). He considers our premises to be a set of propositions h, and our conclusion to be 
a set of propositions a. “If knowledge of ft justifies a rational belief in a of degree q, we 
say that there is a probability-relation of degree a between a and /i” (Keynes, 1921, p. 
4). So for Keynes, probability is a degree of rational belief. However, Keynes does not 
explain what he intends belief to be and not at all what is rational belief. He claims that 
probable beliefs are objective and logical. Keynes then (Chapter IV) discusses a rule 
by which equiprobability could be established, due to Bernoulli (1654-1705), which 
he names the Principle of Indifference, according to which if there were several alter¬ 
natives with no reason for predicating one rather than another, each of the alternatives 
should have an equal probability. This was discussed by very powerful mathemati¬ 
cians: Borel (1871-1956), Poincare (1854-1912), and Bertrand (1822-1900). After 
an unsuccessful attempt to give a forceful presentation of his Principle of Indifference, 
Keynes (1921 ， p. 92) says: “The theory of probability, outlined in previous chapters, 
has serious difficulties to overcome. There is.. .difficulty in measuring or comparing 
degrees of probability... r He turns to the frequency theory of probability and bases 
his ideas on those of Venn’s Logic of Chance (1962). 

Venn (1834-1923) uses as a fundamental concept a series. The variable attributes 
of a series occur in a certain definite proportion of the whole number of cases in the 
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series. The probability of an event is the proportion of the event in the series. The 
origin of the phrase “the frequency theory of probability” is obvious. However, Venn 
did not discuss how a series should be envisaged. In fact, a series appropriate to a 
situation can be obtained only by assumption or from a history judged to be relevant 
and from data analysis. 

It is clear that philosophy has not been able to give a well-founded logic of the use 
of ideas of probability. There have been three developers of ideas of belief calculus 
of the twentieth century. Harold Jeffreys (1891-1989) gave a set of axioms which in¬ 
cluded the idea of a prior distribution: let y = (yi ， " 2 , • • • ， Vn) be the data; assume that 
this is a realization of a random variable whose probability distribution p(y\9) depends 
on a vector of parameters 9\ then suppose 0 is a random variable with probability 
distribution p(9)\ then the joint distribution of y and 6 is p(y, 6) = p(0)p(y\6) which 
is also equal to p{0\y)p(y). Hence p(0\y) = p(y\0)p(0)/p(y). This is the posterior 
distribution of 0 given y. There are two problems: (1) how do we gcip(y\6)l and (2) 
how do we get p(0)l Jeffreys did not address the first. He attempted to obtain a p(6) 
by a logical argument, but failed (naturally). The Jeffreys development is an attempted 
completion of the Keynes development. 

A second development was by F. P. Ramsey (1903-1930), who held the view that 
probability had to be based on knowledge and could be scaled by reference to frequency 
obtained by independent tosses of a perfect coin. 

A third development was made by L. J. Savage (1917—1971)，who used the ideation 
of Ramsey, in his book, The Foundations of Statistics (1954), where he advocated 
that the prior should be obtained by “introspection.” This work has received great 
support and has led to the resurgence of “Bayesian inference,” which, incidentally, 
is a misnaming, because Bayes (1701-1761) obtained his prior by a supplementary 
experiment. 

There have been attempts to justify what are called noninformative prior distribu¬ 
tions. Also there have been attempts to relate the choice of prior to the nature of the 
likelihood functions, p{y\0). This function must be obtained from data analysis, so we 
are back to “square one.” 

1.9.3 Variability and Randomization 

The need for use of probability ideas arises from the fact that variability of outcome is 
omnipresent. Furthermore, this variability must be discovered by actual observation of 
the variability and cannot be discovered by “pure thought.” So, in the beginning must 
come the obtaining of data and data analysis. 

Because we are concerned with design and analysis of experiments we have to 
consider how we can “live with” variability. We cannot assume that our data are a 
realization of some convenient stochastic process，e.g., a Gaussian linear model. We 
shall use randomization and rely on randomization tests of significance and inversions 
thereof to obtain intervals of uncertainty about effects of treatments. Doing one full test 
and inversion requires massive computation. However, we find that the randomization 
distribution of the usual test statistics is closely approximated by the Gaussian linear 
model distribution of the same statistics. This will be discussed in Chapter 6. The in¬ 
versions of randomization tests of significance gives then statistical intervals of uncer- 
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tainty, commonly called (but erroneously) confidence intervals (see, e.g., Kempthorne 
and Folks, 1971). The point is simply that the probability is ensured essentially by the 
randomization procedure. 
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CHAPTER 2 


Principles of Experimental 
Design 


In subsequent chapters we shall describe in detail various experimental designs, their 
properties, construction, and analysis. We shall start with very simple designs and then 
proceed to more complex designs. Each design is based on a certain rationale (and 
we shall explain the basis for this) and is applicable in certain experimental situations. 
There are, however, some basic, common principles of experimentation and experi¬ 
mental designs that need to be clearly understood. These principles have to do with the 
formulation of the problem under investigation, the choice of the experimental design, 
the execution of the experiment, the analysis of the data, and the interpretation of the 
results. We shall discuss these principles in general terms in this chapter, leaving the 
more specific details for later chapters dealing with specific designs. 


2.1 CONFIRMATORY AND EXPLORATORY 
EXPERIMENTS 

Most experiments are of an exploratory nature in the following sense. The investigator 
is interested in finding out what factors have an influence on the outcome of a certain 
process. For example, one might be interested whether or to what extent the factors 
concentration of a chemical compound, time of baking, temperature of oven, degree of 
cooling, and amount of pressure have an effect, either individually and/or jointly, on 
the breakability of a certain type of cookware. The obvious procedure to follow here is 
to vary the “levels” of these factors and compare the performance of the various level 
combinations. How exactly this is to be done is, however, not so obvious. To perform 
the experiment many decisions have to be made, such as: the choice and number of lev¬ 
els of the various factors, possibly selecting only a subset of all feasible combinations; 
the choice of the experimental layout as determined partly by the physical conditions, 
partly by statistical considerations; the choice of measurement of the performance ； and 
the choice of the statistical analysis which is most appropriate for drawing conclusions 
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for the intended purpose. We shall address these types of questions in later chapters 
in great detail and discuss the underlying principles so that an investigator can make 
appropriate decisions for a particular problem at hand. 

Experimentation is essentially a sequential process. One experiment leads to an¬ 
other as some insight is gained from a process and new questions are being asked. An 
exploratory experiment, as described above, may be followed by what we may call a 
confirmatory experiment. We may, for example, want to compare the “best” procedure 
found from the exploratory experiment with an established procedure or product and 
“establish” that the procedure or product is “better” than the old. This in itself is already 
a well-defined and narrower problem than the one described earlier. As such, it calls 
for different design considerations. For example, the number of experimental runs may 
be very important so that the resulting statistical analysis, that is, the statistical test, 
may achieve a certain desirable power. 

To pursue the idea of a confirmatory experiment in a different direction, we may 
have found the “best” procedure and may want to establish, for process control pur¬ 
poses for example, its statistical properties. We know that process conditions may 
change and it is important, therefore, to establish the mean performance and the vari¬ 
ability associated with the process. For unsatisfactory values this may lead to refine¬ 
ments in the actual production process. 

The discussion up to this point has been deliberately vague. It is merely intended 
to give the reader some idea about kinds of experiments. We urge the reader to think 
about similar experiments in other fields of investigation and then carry them through 
the individual steps (to the extent possible) of experimentation which we shall outline 
in the following sections. 

2.2 STEPS OF DESIGNED INVESTIGATIONS 

In practical situations many scientific or industrial investigations are doomed to fail. 
There are many and varied reasons for this, but the most often encountered reason 
is simply that the investigation was not properly planned. Many investigators fail to 
understand that careful pre-planning is essential for a successful experiment. This in¬ 
volves going through a number of steps and making decisions at each point before the 
actual investigation begins. 

A schematic presentation depicting the logical steps of scientific and industrial ex¬ 
perimentation is given in Figure 2.1. 

In the following sections we shall comment on the individual steps and explain their 
importance in the overall process (for an alternative description of such an approach 
for industrial experiments see Coleman and Montgomery, 1993). These steps can be 
divided into two categories: statistical (that is, development of the statistical design, 
translation into a statistical model, and statistical analysis, which we shall refer to as 
the “statistical triangle,” indicated by solid lines in Figure 2.1) and nonstatistical in 
nature. Even though we shall concentrate in this book on the purely statistical aspects 
of experimentation, it is important to realize that the nonstatistical steps are intimately 
connected with the statistical steps and require interaction between the subject matter 
scientist/investigator and the statistician and should not be ignored in any discussion 



22. STEPS OF DESIGNED INVESTIGATIONS 


31 



ETC. 

Figure 2.1 Logical steps of scientific experimentation. 


of designing an experiment. 

2.2.1 Statement of the Problem 

Investigation starts often with a simple speculation: “A tree in my garden is hurting; 
I wonder if it needs more water?” Immediately this leads to questions, such as “How 
much water should the tree get and how often should it get water?” or “Are there 












32 


CHAPTER 2. PRINCIPLES OF EXPERIMENTAL DESIGN 


other deficiencies that need to be corrected and if so, how?” On a more scientific level, 
speculations and questions of this kind lead to the formulation of a problem: “I wish 
to determine the best cure for the tree in its present state” or, more in keeping with the 
topic of this book, “I wish to compare the effectiveness of alternative procedures for 
curing the tree.” 

Although this may sound obvious, each scientific investigation must begin with the 
development and a statement of a problem. This is important not only with respect to 
subsequent statistical considerations, but also with respect to delimiting the problem to 
one that can be addressed realistically. All too often experiments are started without 
a clearly formulated question, purpose or goal in mind, and all too often it is realized 
too late that such an experiment has been conceived and laid out on too broad a basis 
leading often to practical difficulties in actually carrying out the experiment. This may 
lead to a curtailment of the experiment in midstream which may in turn result in an 
unsatisfactory experiment, that is, one that cannot answer the most important question 
or questions that the researcher may have. Obviously, other considerations come into 
play also. These will be outlined in general terms below as a guide to determining 
reasonable strategies for experimental investigations. 

2.2,2 Subject Matter Model 


The first step, statement of the problem, is equivalent to the formulation of questions 
or research and working hypotheses. As mentioned above, such hypotheses must be 
stated as clearly as possible even though the formulation may not be as precise as that 
of a statistical hypothesis. Statements such as “I want to compare treatment (proce¬ 
dure) X with treatment (procedure) F” or “I want to compare several treatments” or 
“I want to find out which factors have an effect on a certain outcome of a process” are 
usually quite appropriate. This, in turn, will lead in an obvious way, to the formulation 
of what we might call a subject matter model. By this we simply mean a listing of all 
the factors that might influence the outcome of the experiment. Such factors will in¬ 
clude the treatment factors which are the main objective of the investigation as well as 
classification (or blocking) factors which are determined by the conditions under which 
the experiment is performed (see also Section 2.2.4). It cannot be emphasized enough 
that such a listing is crucial to the whole investigation for the following reasons: (i) 
It determines, if not completely then to a large extent, the choice of the experimental 
designs, and (ii) it defines the target population with respect to which inferences can be 
drawn. 

We shall illustrate these ideas with the following examples. 


Example 2.1: Suppose we are planning a chronic heart failure randomized clinical 
trial to investigate the effect of carvedilol and metoprolol on the regional vascular re¬ 
sponses to adrenergic stimuli (Hryniewicz et al., 2003). In addition to the treatment 
factor the following factors may need to be considered in deciding on the final trial 
protocol: gender of subjects; type of subjects, e.g. normal subjects, New York Heart 
Association class II, III, IV patients; prior or concurrent type of treatment; age of sub¬ 
jects. □ 
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Example 2.2: In a study to assess the effect of cognitive behavioral therapy for 
nocturnal panic (Craske et al. ， 2005) additional factors to include may be gender, race, 
marital status, level of education, employment status. □ 

EXAMPLE 2.3: The importance of mycorrhizal colonization in the establishment and 
growth of forest trees has long been recognized, and mycorrhizal inoculum is used 
regularly in replanting (Amaranthus et al .， 2005). To further study this effect the fol¬ 
lowing treatment factor maybe considered: Type of ectomycorrhizal fungus inoculum, 
amount of inoculum, type of application. Other classification factors may include: 
Species of trees, age of seedlings at time of inoculation, environmental conditions in 
the greenhouse, type and amount of pesticide application. □ 

EXAMPLE 2.4: A study was conducted to investigate the effects of drinking saline 
water on farmed deer (Kii and Dry den, 2005), In addition to the treatment factor salin¬ 
ity, the following factors may have to be considered: Deer species, gender of deer, age 
of deer, location of deer population, other environmental conditions. □ 

Based upon these factors and possibly others imposed by physical or biological lim¬ 
itations and/or statistical considerations (which will be explained in detail in this book), 
a suitable experimental design has to be chosen together with an appropriate statistical 
model. These two steps go hand in hand and once they are established the course of the 
investigation is pretty well determined and so is the basic statistical analysis. Thus, at 
this point any reconsideration of the experiment, if needed, should take place (indicated 
by the broken lines in Figure 2.1). One can think of a number of reasons why such a 
reevaluation and reformulation of the experiment might become necessary: (i) The ex¬ 
periment, as conceived, has become too big and too complex to be carried out under 
existing conditions, (ii) the physical limitations imposed by the available experimental 
material may make it impossible to obtain any or part of the information sought, and 
(iii) not enough experimental units are available to yield “good” information. 

2.2.3 Three Aspects of Design 

The point we are trying to make here is that this is the appropriate time to think the 
experiment through to its logical conclusion before embarking on it. Of crucial impor¬ 
tance in this respect is the choice of the experimental design which consists of three 
components: (i) treatment design, (ii) error-control design, and (iii) sampling and ob¬ 
servation design. The treatment design determines the treatments to be included in the 
study: which treatments should we choose and how many? The treatments may be de¬ 
termined by various treatment factors and level combinations of such factors. Then the 
questions arise how many factors should be used, how many levels should be included 
for each factor, what is a reasonable range for these levels or what are possible choices 
for these levels. Not only will this depend on whether the factors are qualitative or 
quantitative, but also what kind of information is being sought and how that will be 
reflected in the analysis. It is impossible to give specific guidelines as to how to answer 
these questions. Each experiment has its own characteristics and demands. General 
guidelines will, however, become obvious as we discuss in later chapters more specific 
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designs, both treatment designs and error-control designs. 

Aspects of treatment design are closely connected to aspects of error-control de¬ 
sign. By error-control design we mean the actual arrangement of the treatments in an 
experimental plan using a rule of assigning the treatments to experimental units, that 
is, to pieces of experimental material. Examples of such designs are the completely 
randomized design, randomized (complete or incomplete) block design, Latin square 
design, etc. (see Chapter 3). The choice of an error-control design depends on the 
availability of experimental units, the structure of those units, and the precision of esti¬ 
mation desired by the investigator. For example, if the experimental units have a block 
structure, that is, can be grouped into sets (blocks) of homogeneous units, then some 
form of randomized block design may be called for. Or if the experimental units exhibit 
heterogeneity in two directions (as perhaps in a field trial), some form of row-column 
design (e.g.，Latin square, Youden square) may be the most appropriate design. The 
principle of blocking (see Section 2.5) and the way in which the resulting designs con¬ 
trol the error will be explained in later chapters together with reasons for choosing one 
design over another. 

The third component of the experimental design is the sampling and observation 
design. It determines at which level observations are being taken and what kinds of 
observations are being taken. More precisely it tells us whether the observational units 
are the same as the experimental units or whether subsampling from the experimental 
units is to be done. Also, it specifies whether univariate or multivariate observations 
are to be taken. 


As mentioned before and as indicated in Figure 2.1, the development of the exper¬ 
imental design and the formulation of an appropriate statistical model are intimately 
connected in that the structures of the treatment design, the error-control design, and 
the sampling and observation design determine essentially the complexity of the sta¬ 
tistical model. In the context of this book, the statistical models will be linear models 
or linearized forms of nonlinear models, more specifically classification and regression 
models (see Chapter 4). Since this book is concerned mainly with comparative rather 
than absolute experiments, the linear models used will be classification models incor¬ 
porating the effects associated with the three component designs discussed above. This 
will be made clearer as we discuss the various designs. 

Having chosen a suitable experimental design, the experiment itself can now be 
performed. It is worth noting here that although this part of the whole experimental 
process appears to be nonstatistical in nature, it is crucial that it be carried out in con¬ 
formance with the statistical requirements for the design chosen. This includes, for 
example, proper randomization of the treatments to the experimental units and proper 
replication of the application of the treatments. For this reason it is important for the 
statistician and the investigator to at least work out and write up a protocol spelling out 
all the details of the experiment as far as possible. Included in this should also be de¬ 
tails about the actual data collection and the measurement process, for example, when 
data should be collected and what the scales of measurement are. 
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2.2.4 Modeling the Response 

In the actual experiment each treatment factor is represented by different “levels”，that 
is, different forms or different amounts, such as different types of inoculum，different 
amounts of inoculum. With regard to the other factors in the subject matter model, the 
investigator may decide to restrict a factor to just one level, for example, only russa 
deer, or include several levels, such as patients from different illness severity groups. 
In the latter case these factors will have to be also included in the ensuing analysis of 
the data. For this purpose it is important to provide a suitable model of the response 
data. 

To formalize this idea in general terms (which will be made more specific in later 
chapters) we write 


Response = /(Explanatory variables) + Error. (2.1) 

where / represents an unknown function and the explanatory variables refer to treat¬ 
ment and blocking factors as employed in the treatment and error-control designs, re¬ 
spectively. 

Among the blocking factors are factors identified by the subject matter model (Sec¬ 
tion 2.2.2) as essential for defining the target population for purposes of statistical in¬ 
ference. Cox (1984) referred to these factors as intrinsic factors. We shall adopt his 
terminology and divide the blocking factors into intrinsic and nonspecific factors, the 
latter being determined by the necessities of the error-control design, that is, consid¬ 
erations of further reducing heterogeneity of the experimental material. If we denote 
the set of treatment factors by X = { 工 1 ，工 2 , . • • ； Xt}, the set of intrinsic factors by 
Z = {zi , 2:2 ； • - ； z q }, and the set of nonspecific factors by U = {u\ , U 2 ...., u s }, we 
can rewrite (2.1) more explicitly as 

y = /(xi ， x 2 . zi,z 2 ,... ： z g ; uuu 2 ,. ...u s ) e. (2.2) 

We illustrate the above terminology with the following examples. 

Example 2.5: We consider an experiment reported by Pearce (1953, 1983) (see also 
Hinkelmann, 2004) comparing different pruning managements of pear trees. Combi¬ 
nations of different types and amounts of pruning are assigned to individual trees in 
each of several selected rows of trees. The trees in each row are quite uniform, but 
there exist row-to-row differences due to environmental conditions. For purposes of 
inference several varieties of pear trees were included in the experiment. In this setting 
there are two treatment factors: X\ = type of pruning and X 2 = amount of pruning, 
one intrinsic factor: z\ = variety of pears, and one nonspecific factor: u\ = rows of 
trees. The final experimental design is a factorial treatment design (see Section 11.1) 
in a randomized complete block design (see Sections 9.1 and 9.2). □ 

Example 2.6: The following description of a clinical trial serves as another illustra¬ 
tion. Suppose we want to investigate the effectiveness of different treatments with re¬ 
gard to the elimination of a certain type of skin rash on the human body. The treatment 
factors are x\ = concentration of lotion, X 2 = frequency of application. A combina¬ 
tion of different levels of each factor defines the medical treatment, and each arm of 






36 


CHAPTER 2. PRINCIPLES OF EXPERIMENTAL DESIGN 


each patient included in the trial receives a different treatment. The trial includes male 
and female patients classified according to disease severity. Thus gender and severity 
classes represent the intrinsic factors Zi and 22 , respectively. The patients represent a 
nonspecific factor, u\. The resulting experimental design consists of a factorial treat¬ 
ment design (see Section 11.1) and some form of incomplete block design (see Section 
9.8) as the error-control design. □ 


2.2.5 Choosing the Response 

In the preceding discussion we have used the term “response” in a generic sense. In 
many situations it is actually clear what the response or response variable should be. 
If, for example, we want to determine the effect of different manufacturing processes 
on the strength of a certain type of plastic tube, then the obvious response is measured 
in psi, pounds per square inch, needed to destroy the tube. As another example, if we 
want to assess the effects of different pollutants on a certain crop, it may not be so clear 
what should be measured. We could measure the growth of the plants at the end of 
the trial or at the end of the growth period, or the yield of the plants at the end of the 
growing season, or the amount of damage on the leaves of the plants at the end of the 
trial. 

Our intention here is to point out that the researcher must give careful considera¬ 
tion to the choice of response variable, one that is most meaningful in the context of 
and most clearly associated with the expected outcome of the experiment. In other 
words, the response variable should be chosen so that the inferences and results from 
the experiment can be clearly stated and communicated. In this context, a continuous 
response variable is preferable to a binary or ordinal variable because it contains more 
information. On the same grounds, an objective, that is, measurable variable is prefer¬ 
able to a subjective variable. All this depends, of course, on whether we have a choice 
at all. 

There are still other considerations. For example, should we measure the yield of 
an individual plant or of a bunch of plants? Or, should we measure the pulse rate of an 
individual at a certain point in time or at several time points within a specified period? 
Again, this may be determined by the type of inference we want to make concerning 
the treatments used in the experiment. 

2.2.6 Principles of Analysis 

Once data have been collected they will be subjected to a statistical analysis in con¬ 
cordance with the experimental design and its associated model. Such analyses will be 
dealt with in great detail in later chapters. We shall mention at this point only the basic 
principles involved. 

A major aim of analyzing data from designed experiments is to quantify and eval¬ 
uate the importance of possible sources of variation. This can be achieved through the 
analysis of variance (ANOVA) associated with the underlying linear model, either in 
its univariate or multivariate form. The topic of ANOVA will be taken up in great detail 
in Chapter 4. For purposes of the discussion in this chapter we shall give just a brief 
outline. 
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Given observations, y say, from an experiment, the general idea of ANOVA is to 
partition the “total variability” (or total sum of squares), SS(Total) = E(y — y) 2 , into 
component parts as specified by an underlying linear model. Such a model reflects the 
structure of the observations as determined by the treatment design, the error-control 
design, and the sampling design [see also (2.3)]. Each design is represented by several 
sets of effects (parameters). These effects provide a more explicit expression of model 
(2.2) in the form of so-called main effects of and various interactions among the ex¬ 
planatory variables in (2.2), in addition to one or more error terms (see Section 2.3.2). 
Suppose there are q such sets altogether. Then, using the method of least squares (see 
Chapter 4), SS(Total) is partitioned (not necessarily uniquely) as follows 

SS(Total) = SS(1) + SS(2) + • • • + SS(g), 

where SS(i) represents the sum of squares associated with the ith set of effects (i = 
1.2,... ,q) accounting in some sense for the variation that can be attributed to these 
effects [see also Section 2.9 for a brief discussion of the partition of the total number 
of degrees of freedom (d.f.) into the d.f. associated with the individual SS(z)]. Of 
particular interest and importance in our subsequent discussion will be the sums of 
squares associated with the treatments, and with experimental error. 

The ANOVA provides the basic information necessary for making statistical infer¬ 
ence either in terms of tests of hypotheses (or tests of significance) or confidence inter¬ 
val estimation. Associated with a sum of squares, SS(z), are the d.f. and the mean 
squares, MS(z) = SS(z)/^. It is the form of the expected mean squares, £/[MS(z)], 
which determines, for example, how tests of hypotheses are performed and how error 
variances are estimated. 

All tests of hypotheses (or significance) and estimation of parametric functions are 
done in accordance with the aims of the experiment. Thus the statistical results will 
have to be interpreted in terms of the investigator’s originally formulated hypotheses. 

2.3 THE LINEAR MODEL 

23.1 Three Types of Effects 

In our discussion we have pointed out repeatedly the importance of the linear model 
as it relates to the subject matter model and the experimental design. Such models 
will be given or derived for all designs presented in later chapters, but it seems useful 
to give some heuristic arguments here about the form of these models. The general 
idea is to express the observations, generally denoted by y, in terms of “effects” which 
contribute to y. These effects or components fall basically into three categories: (i) 
treatment effects, (ii) design effects, and (iii) error effects. 

The treatment effects are a reflection of the intervention procedure or treatment 
design. The treatment factors listed in X (see Section 2.2.4) indicate whether a single 
treatment or combinations of several treatment factors are used. Together with subject 
matter considerations this will determine which effects, that is main and interaction 
effects, will be included in the linear model. 
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The design effects are determined by the explanatory variables included in 2. and 
U (see Section 2.2.4). We refer to these effects also as block effects as part of the 
error-control design. 

In addition to pure treatment and design effects we may need to include occasion¬ 
ally treatment x design interaction effects into the model. These arise from possible 
interactions between treatment factors in X and intrinsic factors in Z (see Section 9.6.8, 
also Hinkelmann, 2004). 

Finally, the error effects, or errors for short, represent different kinds of random 
variation. Such variation arises in connection with the experimental and observational 
units (see Section 2.3.2) as well as some aspects of the actual experimentation and 
data collection. Again, these aspects will be discussed in more specific details in the 
following chapters (see, in particular, Section 6.3). 

We illustrate some of the notions discussed above in the following example. 

Example 2.7: The objective of a study by Rosen et al. (2005) was to determine 
whether nitrogen and sulfur fertility affects glucosinolate concentrations in cabbage. 
The treatment factors were x\ = nitrogen = N (at two rates), — sulfur = S (at two 
rates), xs = cultivars (two types: green cabbage and red cabbage). The experiment 
was set up as a so-called split-plot design (see Chapter 13) with four replications in 
two years. Thus z\ = year as an intrinsic factor and u\ = replicate as a nonspecific 
factor. 

The treatment effects included in the model then are: A r rate, 5 rate, N rate x S 
rate interaction, cultivar, cultivar x N rate interaction, cultivar x S rate interaction, 
cultivar x N rate x S rate interaction. The design effects are year effects and replicate 
within year effects. In addition, the treatment x design interaction effects include N 
rate x year interaction, S rate x year interaction, cultivar x year interaction. □ 

2.3.2 Experimental and Observational Units 

In order to understand the nature and use of the error effects or error components it is 
essential to understand the distinction between the (possibly different) units to which 
treatments are applied and on which observations or measurements are being made. 
These units are called experimental units and observational or sampling units, respec¬ 
tively. The experimental unit (EU) is the piece of experimental material, to which a 
treatment is assigned and applied. For example, in a clinical trial where different pa¬ 
tients are given different drugs, each patient is an EU. If, on the other hand, each patient 
is given a different ointment on each arm, then each arm constitutes an EU. Associated 
with an EU is experimental error. Such error is a reflection of the fact that EUs are not 
alike, that is, cannot be replicated exactly. Contributing to experimental error is also 
our failure to replicate a treatment exactly, that is, instead of administering 15 ppm of a 
certain substance, as called for in the protocol, we administer to some units 14 ppm or 
16 ppm and so on. We refer to this component of the experimental error as treatment 
error. 

We emphasize here already that it is very important to always clearly identify the 
experimental units for a given experimental situation. In the case of several treatment 
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Figure 2.2 Schematic representation of experimental layout. 


factors it may happen that different treatment factors are applied independently and 
separately to different types of experimental units. As a consequence there will then be 
also different experimental errors, and that becomes important for tests of significance. 
This is the special characteristic of split-plot type designs (see Chapter 13). 

We shall use the experimental setup of the study in Example 2.7 to illustrate the 
point mentioned above. 

EXAMPLE 2.7 {continued): Here we have two types of experimental units. More specif¬ 
ically we have “large” EUs to which combinations of the two rates of N and S are 
applied, and two “small” EUs within each large EU in each of which one type of cab¬ 
bage is grown. If we denote the two rates of N by n\, 77 - 2 , the two rates of S by si, S 2 , 
and a combination of them by (n“ Sj) (i, jf = 1, 2), then the essence of the experimen¬ 
tal layout for one replicate in one year is illustrated in the schematic representation of 
Figure 2.2, where the open circles O represent green cabbage and the full circles • 
represent red cabbage. □ 

It is important to distinguish between the EU and the observational unit (OU). The 
observational (or sampling) unit is the unit on which observations, that is, measure¬ 
ments, are made. In many situations EU and OU are identical, but in other situations 
they are not. For example, in an educational study a class, that is, a collection of stu¬ 
dents, is the EU as the class as a whole is subjected to a particular teaching method 
which is the treatment. Observations are made, however, on the individual students in 
the form of test scores. Then the students are the OUs. Associated with the OU is an 
observational (or sampling) error, which reflects, among other things, measurement 
error and also sampling error in that if the experiment were repeated, most likely other 
students would be part of the study. 
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To elaborate on this point further we consider again the experiment described in 
Example 2.7. 

EXAMPLE 2.7 (continued): One can think of different scenarios of obtaining data in this 
setting: 

(/) In each row (small EU) we may randomly select one cabbage head and perform 
the appropriate chemical analysis on it. In this case, at least from a statistical 
point of view, there is then no distinction between EU and OU. It illustrates, 
however, the point that there will be sampling error as part of the observational 
error. 

(n) We may randomly harvest two cabbage heads per row, chop them and then com¬ 
bine them in a forced air dryer for further analysis. For purposes of statistical 
analysis the situation is the same as in (i). 

(iii) Two (or more) cabbage heads are randomly harvested from each row and glu- 
cosinate extraction is performed on each head. In this case then the EUs and 
OUs are different: The rows are the EUs and the individual cabbage heads are 
the OUs. This is an example of what is referred to as subsampling (see Sections 
3.5 and 6.9). □ 

2.3.3 Outline of a Model 

In equations (2.1) and (2.2) we have given a very general form of a model for the 
observations from an experiment. The discussion above suggests that the function / 
in (2.1) and (2.2) is a linear function. A formal derivation, based on the notion of 
unit-treatment additivity (see Section 6.3), will be given in later chapters. The idea 
we want to convey here is that the response after the intervention, which, following 
an agronomic practice, is often called yield, is made up additively of a unit effect plus 
a treatment effect plus error effects, e.g., unit or experimental error, observational or 
measurement and sampling error. The unit effects and experimental error effects are 
used to model the systematic and random influences, respectively, of the error-control 
design. Hence we shall refer to the unit effects also as the design effects. They contain 
the effects of the intrinsic and nonspecific factors of (2.2). 

Schematically we write all this as 

Observation = Design effect + Treatment effect 

[+ Design x Treatment interaction effect] ^ ^ 
+ Experimental error 
+ Observational error. 

In (2.3) we have added the design x treatment interaction effect which may arise in 
certain experimental situation as a component of interest. We shall elaborate on this 
model in more detail as we discuss specific examples and different designs — error 
control designs as well as treatment designs — in later chapters. 
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2.4 ILLUSTRATING INDIVIDUAL STEPS: 

STUDY 1 

2.4.1 The Questions and Hypotheses 

We shall now illustrate the various steps we have described in Section 2.2 and the for¬ 
mulation of a linear model as outlined in Section 2.3 in terms of an example. Suppose 
an investigator wants to study the effects of air pollutants on seedlings of loblolly pine. 
The pollutants to be used, singly and in combinations, are O3, ozone, and NO2, nitro¬ 
gen dioxide, at levels .00, .05, .10 ppm for 6 hours/day for 28 consecutive days, applied 
to seedlings at uniform age. The investigator is interested in: (i) comparing the dam¬ 
aging effects of the pollutants and (ii) assessing potential synergistic effects of O3 and 
NO2. These effects are possibly influenced also by the genetic make-up of the trees, 
that is, whether they are relatively susceptible or relatively resistant to air pollutants 
(Kress, Skelly and Hinkelmann, 1982b). More formally, the research hypotheses can 
then be stated as follows: 

(i) Long-term exposure to O 3 and NO 2 has damaging effects on pine seedlings with 
respect to growth, mottling, and chlorotic spot symptoms (see e.g., Kress, Skelly 
and Hinkelmann, 1982a). 

(ii) The amount of damage increases with the level of pollution. 

(iii) A combination of pollutants will exhibit synergistic effects. 

(iv) The amount of damage will also depend on the degree of sensitivity (as deter¬ 
mined genetically) of the family from which the seedlings come, one type of 
family being relatively resistant and one being relatively susceptible. 

The observations and measurements will then be determined and influenced by the 
treatments, that is, the type (combination) of pollutants, the level of pollution, by the 
genetic background, and for growth by initial height. Other factors such as temperature, 
amount of light, and humidity, may have to be taken into account depending on the 
experimental design to be adopted. The seedlings will be exposed to the pollutants in 
pollution chambers. 

2.4.2 The Experiment and a Model 

In what follows we shall use language and terminology which is not very precise and 
intended only to give the reader some feeling for and appreciation of the various con¬ 
cepts we have mentioned so far. More precise formulations will be given in subsequent 
chapters. 

The error-control design depends on the availability and arrangement of the pollu¬ 
tion chambers. Suppose the researcher has 18 chambers available to him distributed 
in a laboratory under uniform environmental conditions, such as heat, light, and hu¬ 
midity. One possible arrangement then would be to randomly assign each treatment, 
that is, each of the nine possible pollution combinations, to two pollution chambers. In 
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This structure, which is also referred to as a factorial structure, enables one to partition 
each effect into 0 3 and N0 2 main effects and O 3 x N0 2 interaction. This leads to 
particular comparisons among the nine pollutants which will enable us to answer, for 
example, the question posed originally whether O 3 and NO 2 if applied jointly exhibit 
synergism. 

Relating model (2.4) to the terms of our discussion in Section 2.2.4 we note that 
O 3 and NO 2 represent treatment factors, type of family represents an intrinsic factor, 
and pollution chamber represents a non-specific factor. 

2.4.3 Analysis 

An outline of the statistical analysis can be exhibited in an analysis of variance table 
as given in Table 2.1 (see Chapter 13). This table indicates, again not in very precise 
terms, which hypotheses can be tested and that these are in agreement with the inves¬ 
tigator^ aims. More specific hypotheses can be tested using follow-up procedures as 
described in Chapter 7. Our main point in all of this is that the experiment is designed 
in such a way that it can provide answers to the questions posed at the outset of the 


each chamber a specified number of seedlings, 25 say, equally divided between the two 
families will be exposed in the prescribed way to the assigned pollutant. We shall refer 
to this as arrangement I. 

A linear model associated with this experimental setup might be as follows: 

= + Fk~\~〈P cijk Tjijkl ， (2.4) 

where yijki denotes an observation for the Ith seedling of the A:th type of family in the 
jth chamber assigned to the ith pollutant, and (i is an overall mean, Pi is the effect of the 
ith pollutant (i = 1.2,... ,9), C 勿 • is the effect of the jth chamber (j = 1,2) assigned 
to the ith pollutant, Fk is the effect associated with the kth type of family (k = 1 , 2 ), 
(PF)ik is an effect due to the interaction (nonadditivity) between the ith pollutant and 
the fcth type of family, and £ijk represents an experimental error component and rjijki 
represents the observational (or sampling) error (l = 1,2,.... 5). We note that the 
experimental error here consists of two components: one component (Qj) arises from 
the application of the pollutants to different chambers (= EU); the other component 
(Sijk) arises in connection with each chamber-family combination as the families, even 
though they are labeled resistant, say, are not identical, that is, not exactly reproducible 
as they may be full-sib families produced from different trees. Model (2.4) can be 
expanded further by making use of the fact that the treatments, that is, pollutants, are 
actually level combinations of two factors, O 3 and N0 2 , as shown below: 
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Table 2.1 Analysis of Variance for pollution 
Arrangement 1 





Research Hypothesis That 

Source of Variation 

d.f. 


Can Be Tested 

Pollutants 


8 

Differences among pollutants [see (i)] 

0 3 


2 

Differences among levels of O 3 , 
averaged over NO 2 [see (ii)] 

no 2 


2 

Differences among levels of NO 2 , 
averaged over O 3 [see (ii)] 

O 3 X NO 2 


4 

Synergism between O 3 and NO 2 
[see (iii)] 

Chambers (Error 1) 


9 


Families 


1 


Pollutants x Families 


8 

Interaction between pollutants and 
families [see (iv)] 

Error 2 


9 


Obs. Error 

36(s — 

i) 


Total 

365 — 

1 



experiment. This, of course, does not imply that this is the only way to achieve these 
objectives. Physical conditions and fiscal considerations may, indeed, dictate another 
course of action as long as it is consistent with the aims of the experiment. Concern¬ 
ing the performance of the experiment, care must be taken that the treatments, that 
is, pollutants, are assigned at random to the pollution chambers, and that the seedlings 
within a chamber are arranged at random or in some rotating fashion for the duration of 
the experiment. An established protocol controlling other “environmental” conditions 
will have to be followed. Attention must be given to the evaluation procedures. For 
example, should foliar symptoms be measured or evaluated on each needle and how, 
or should each seedling as a whole be rated. How should height growth be measured, 
from where to where and during which time period? 

After the appropriate data have been collected they will be analyzed according 
to the model outlined above. It is difficult to draw conclusions in the abstract here 
without any real data, but in light of what has been said before it should be clear how 
the results from this experiment can be interpreted. The question of synergism can be 
answered directly. As a result, it is not difficult to imagine that new questions might be 
raised which then will lead to a new investigation as part of sequential experimentation. 
The crucial point in designing an experiment is to make sure that the investigator’s 
questions can be answered in the context of the statistical analysis. This means that we 
must be able to test, in the analysis of variance table, the statistical hypotheses which 
correspond to the research hypotheses. 
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2,4.4 Alternative Experimental Setup 

To show in terms of the example discussed above how things can go wrong, we con¬ 
sider the following alternative arrangement, referred to as arrangement II. We assign 
each pollutant combination to two chambers with 2$ seedlings of one family in one 
chamber and 2s seedlings of the other family in the other chamber. Expressed alterna¬ 
tively, this means that each combination of pollutants and family is randomly assigned 
to one chamber (this implies that ’’family” is now a treatment factor). The reader should 
recognize that this arrangement is, indeed, different from arrangement I. As a conse¬ 
quence, a different linear model will be used to analyze the data. It can be written in 
the following form: 

Viki = + Pi + Fk {PF)ik + s* k + rjiki , (2.5) 

where all the terms are as defined before with e* k and rj 侃 (l = 1 , 2 ,..., 2 s) repre¬ 
senting experimental and observational errors. An outline of the associated analysis 
of variance is given in Table 2.2. The main result here is that there are zero d.f. for 
experimental error (this follows formally from the position of the total d.f., 36s — 1, 
but also from the fact that each treatment combination is assigned to only one chamber 
(see Section 2.5)) which implies that we cannot test any hypotheses unless we assume 
that all or parts of the interaction between pollutants and families is negligible. That 
assumption, however, is not realistic in light of research hypothesis (iv). Hence this 
arrangement is of no value and should, therefore, not be used. The only reason for 
mentioning this arrangement then is to emphasize the importance of checking whether 
a particular arrangement will lead to a statistical model and hence to an analysis which 
is capable of providing answers to the questions posed by the investigator. 


Table 2.2 Analysis of Variance for Pollution 
Experiment (Arrangement II) 


Source of Variation 

d.f. 

Pollutants 

8 

0 3 

2 

no 2 

2 

0 3 X N0 2 

4 

Families 

1 

Pollutants x Families 

8 

O 3 X Families 

2 

NO 2 乂 Families 

2 

O 3 x NO 2 X Families 

4 

Expt. Error 

0 

Obs. Error 

18(2s-l) 

Total 

365 -1 
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2.5 THREE PRINCIPLES OF 
EXPERIMENTAL DESIGN 

In describing the steps of an experiment we have emphasized the statistical aspects, in 
particular what we have called with reference to Figure 2.1, the “statistical triangle,” 
namely choice of an experimental design, that is, treatment and error-control design ， 
formulation of an appropriate linear model, and outline of the statistical analysis based 
on the chosen experimental design and its associated model. To assure validity of the 
analysis and to increase its sensitivity we have to observe three basic principles which 
are crucial to any experiment. 

The first principle is that of replication. By this we mean that each treatment (or 
some of the treatments) must be applied to several experimental units. In the absence 
of systematic differences among experimental units treated alike, such replications will 
enable us to estimate the experimental (random) error against which differences among 
treatments are judged. (Unreplicated experiments are useful only in certain situations 
under certain assumptions.) 

To ensure validity of the estimate of experimental error we rely on the second prin¬ 
ciple which is that of randomization. It leads to an unbiased estimate of variance as well 
as an unbiased estimate of treatment differences, that is, estimates that are free from 
systematic differences due to otherwise uncontrolled variation. We shall comment on 
the principle of randomization in more detail in Chapter 5 as well as in connection 
with the individual types of designs. We shall point out then how randomization is 
to be performed and how that enables one to formulate appropriate linear models and 
what effect it has on the statistical analysis. 

One of the main objectives in choosing an appropriate error-control design is, in 
fact the reduction of experimental error. In many cases this is achieved by means of the 
third principle, that of local control or blocking. The basic idea is to partition the total 
set of experimental units into subsets (blocks) that are as homogeneous as possible. 
In this way the effects of nuisance factors which contribute systematic variation to 
the differences among experimental units can be eliminated. This in turn will lead 
to a more sensitive analysis since, loosely speaking, the experimental error will be 
evaluated in each block so generated and then pooled over the whole experiment. Such 
blocking (by intrinsic and/or nonspecific factors) can occur in various ways and at 
various stages of the experiment and is dictated by the experimental conditions and the 
requirements on the desired sensitivity of the experiment. The Latin square design and 
the split-plot design are examples of more complicated blocking structures which will 
be discussed in greater detail later in this book. The obvious implication of the present 
discussion is that the more blocking is being done, the more sensitive the experiment 
becomes. This is true, however, only up to a certain point and depends on the amount of 
systematic variability associated with blocking factors，that is to say that it is a function 
of the given experimental situation and the amount of knowledge one has about it. Also, 
it should come as no surprise that increased amounts of blocking will invariably lead 
to more complex experiments, complex from the point of view of execution as well as 
analysis. All this will become clearer later on. 
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2.6 THE STATISTICAL TRIANGLE: STUDY 2 

In Section 2.2 we have outlined in great detail the various steps essential to designed 
investigations. These were outlined schematically in Figure 2.1. We have drawn special 
attention to the intimate relationship between the choice of the experimental design, 
the associated statistical model, and the resulting statistical analysis. With reference to 
Figure 2.1, we have called this the “statistical triangle.” Because of its central role in the 
whole endeavor of scientific experimentation as described in this book, it is important 
that we give some further discussion along these lines so that the reader can develop an 
understanding and appreciation of it. We shall do this in terms of a very simple example 
(Study 2). The idea behind this is to show how, using only heuristic arguments, models 
for the observations (yields) can be formulated which reflect different experimental 
situations. The major point we would like to impress upon the reader is that, although 
all models for the situations described below contain the same components in the form 
of treatment effect, experimental error and observational error, the roles of the two 
error components and their associated mean squares in the ANOVA depend heavily 
and crucially on the underlying experimental plan. 


2.6.1 Statement of the Problem 

Suppose an investigator wants to study and compare the effects of pollutants on pine 
seedlings. In addition to charcoal filtered air (Pi) as the control, he includes the fol¬ 
lowing pollutants: ozone (P 2 ), sulfur dioxide (P 3 ), and nitrogen dioxide (P 4 ). This is 
an exploratory experiment for which he has available four seedlings for each pollutant, 
that is, 16 seedlings altogether. We shall assume that the seedlings are of the same age 
and of uniform height, and that a reasonable fumigation protocol has been established 
and is being carried out properly. The questions we are addressing here are: What are 
some of the alternative designs for this experiment; what are the corresponding linear 
models; how can these experiments be analyzed; and most importantly, to what extent 
can these experiments provide answers to the investigator’s questions? 

2.6.2 Four Experimental Situations 

In Tables 23-2.6 we outline schematically four possible (not necessarily good) exper¬ 
imental plans together with appropriate linear models and an outline of the associated 
analysis of variance table. There are, obviously, other ways of conducting this exper¬ 
iment, but we shall use the four situations given here to point out differences among 
them and their associated models and subsequent analyses. 

EXPERIMENT I ： In experimental situation I (Table 2.3)，four pollution chambers are 
used, each chamber containing four seedlings. The pollutants are randomly assigned 
to the chambers with four seedlings placed in each chamber. Since a particular pol¬ 
lutant is administered to a chamber, the chamber or, alternatively, the collection of 
four seedlings constitutes the experimental unit (EU) whereas each individual seedling 
constitutes the observational (or sampling) unit (OU). As a consequence, the treat¬ 
ment effect and the experimental error are “confounded” with each other, or insepa- 
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Table 2.3 Study 2: Experimental Situation I 


MODEL: 


ANOVA: 


P 2 Pi Ps 



jth observation 
for ith treatment 



(2.6) 


z = t(= 4) 

j = 1 ， 2, …， n(= 4) 


Source 


d.f. £(MS) 


Pollutants + Experimental Error 3 4 + 4 , 丄 | E rf 

Observational Error 12 


rable, which is reflected in the model equation (2.6) in that the treatment effect (r) 
and the experimental error (c) have the same subscript. This, of course, leads to 
the partitioning of the total sum of squares, SS(Total), into only two components, 
SS(Pollutants 4 - Experimental Error) and SS(Observational Error). Their expected 
mean squares, E(MS), which are obtained under assumptions to be discussed in later 
chapters, make it quite obvious that there is no legitimate error term to test hypotheses 
about treatment effects, that is, under the null hypothesis that the treatment effects are 
all identical (and equal to zero)，the two MSs do not have the same expected value. 
From that point of view this experiment is unsatisfactory: It cannot answer the investi¬ 
gator's questions. □ 

EXPERIMENT II: In one sense, experimental situation I represents one extreme situa¬ 
tion, the other extreme occurring in experimental situation II. Here each seedling is 
put into a separate pollution chamber, four of which are randomly assigned to each 
pollutant. Then the EU and OU are identical so that the two associated types of errors 
cannot be separated from each other as indicated in model equation (2.7). Both errors, 
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Table 2.4 Study 2: Experimental Situation II 


Pi 


Pz P2 P2 


Ps P 3 


Pi Pa 






e ij 


ANOVA: 

Source 


d.f. 


P 4 Pi 



(2.7) 

S = 1 ， 2, …， t (= 4) 
j = 2, ..ti( — 4) 

E(MS) 


Pollutants 3 

Error (Experimental + Observational) 12 


^ + I E Ti 

+ 


however, can be separated from the treatment effect and hence tests of hypotheses for 
treatment effects are available (see ANOVA table in Table 2.4). □ 

Experiment hi ： In experimental situation III, two chambers are available for each 
pollutant so that each chamber contains two seedlings. Variation among chambers 
(EU) treated with the same pollutant is then a “measure” of experimental error, whereas 
variation among seedlings (OU) within a chamber is a “measure” of observational or 
sampling error. Not only are both types of errors separable from each other, but also 
from the pollutant effects, which is formally expressed in model equation ( 2 . 8 ) as well 
as in the analysis of variance (see Table 2.5). □ 

Experiment iv : Finally, experimental situation IV represents a variation of situation 
III in that the pollution protocol can be carried out on four pollution chambers with 
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MODEL: 


ANOVA: 


Table 2.5 Study 2: Experimental Situation III 




fcth observation 


Ps 



( 2 . 8 ) 


for jth EU (replicate) 
of ith treatment 

i = 1, 2, ..i (= 4) 
j = 1, 2, ..r (= 2) 
k = 1 ， 2, …， n(= 2 ) 


Source 

d.f. 


£(MS) 

Pollutants 

3 


2 ^e +1E # 

Experimental Error 

4 

4 十 

2 a e 2 

Observational Error 

8 
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Table 2.6 Study 2: Experimental Situation IV 


Pa Pi P 2 巧 



kth observation 
for ith treatment 
in jth block 


Source 


d.f. 


E(MS) 


Pollutants 3 

Blocks 1 

Experimental Error 3 

Observational Error 8 


g + 2cr 孓 +! 


4 + 2〜 2 


MODEL: 


ANOVA: 


句 2 "ST 


(=(=(= 
Jr Jon 




27. PLANNING THE EXPERIMENT: THINGS TO THINK ABOUT 


51 


each pollutant once in the morning (M) and again in the afternoon (A). It is expected 
that because of the diurnal rhythm of plants, there are systematic differences among 
the seedlings in the morning and in the afternoon, that is, time of day represents an 
intrinsic factor. Those systematic differences can be “eliminated” by considering the 
two sets of four chambers each as blocks. Moreover, this arrangement may lead to a 
reduction in experimental error (as indicated by of* instead of in Table 2.6). All ef¬ 
fects are separable [see model equation (2.9)] and hence this is a suitable experimental 
procedure. □ 

Experimental situations I, II, and III are different versions of a completely ran¬ 
domized design (see Chapter 6) and experimental situation IV represents a randomized 
complete block design (see Chapter 9). The reader should notice how these different 
arrangements lead to different models and hence to different analyses. This discussion 
should also help to bring out the point we have made earlier that it is important to 
consider the analysis along with the experimental design to ensure that valid statistical 
inferences can be made. We shall not discuss here which arrangement is best, except to 
say that arrangement I should not be used, but the use of the other arrangements may 
be determined entirely by practical considerations and conditions about which we have 
said nothing here. 

2.7 PLANNING THE EXPERIMENT: 

THINGS TO THINK ABOUT 

In the preceding sections we have discussed various aspects of the design process. We 
have shown how these aspects are interconnected and why it is therefore important 
to approach the planning of an experiment in a careful and systematic fashion. To 
emphasize this point we shall summarize below the important features of the individual 
steps. 

1. Statement of the objective (or objectives): 

At least a general formulation of the problem to be investigated is essential before 
proceeding to the next steps. This is even more important if there are multiple 
objectives which are not to be investigated at the same time. 

2. Formulation of the subject-matter model: 

The important point here is to prepare a list of all factors that potentially affect the 
measured response. This involves choosing the treatment factors and identifying 
possible intrinsic factors. In the interest of keeping the size of the experiment at a 
reasonable level it may be necessary to restrain some intrinsic factors to just one 
level, for example female subjects at one age group rather than male and female 
subjects at different age groups. This will, of course, curtail the applicability of 
the results concerning the treatment effects, that is, narrow the inference space of 
the experiment. Another important - and possibly negative - aspect of narrowing 
the scope of the experiment is the inability of investigating possible interactions 
between treatment and intrinsic factors. 
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3. Choosing factor levels: 

We are concerned here with both, treatment factors and intrinsic factors. And 
when we refer to factor “levels” we mean different expressions of that factor, 
for example, different settings, for instance, 200°C and 300°C, for the factor 
“temperature”，or different therapies, for instance, radiation and chemotherapy, 
for the factor “cancer treatment”. For both, treatment and intrinsic factors, it is 
important to consider carefully how many and which levels we should choose. 
The choice will affect the type and amount of inferential information as well as 
the size of the experiment. 

For a quantitative treatment factor, for example, it may be important to use more 
than two levels to assess any possible curvature in the response function (see 
Section 7.4). Moreover, the levels should be chosen within the practical range 
for the treatment, including lower and upper limits of the range. Furthermore, the 
levels should be chosen far enough apart so that a possible difference in response 
becomes detectable, but not too far apart so that a possible change in response 
at an intermediate level goes undetected. For example, there may not be any 
difference between the temperature factor levels 200°C and 225°C, but 200°C 
and 250°C may be far enough apart for detecting a possible change in response. 
On the other hand, the levels 200°C and 400°C, may be too far apart because 
important changes may occur between 250°C and 300°C. 

Similar considerations hold also for qualitative factors. For the treatment factors 
it is usually quite clear which levels to choose, but for the choice of intrinsic 
factor levels the size of the experiment may become important. For example, in 
order to investigate the effects of different types of pollutants on plants it may 
be appropriate to confine oneself to trees or tree seedlings initially. And rather 
than including in the experiment different species of conifers it may be useful to 
choose one species from coniferous and one from deciduous trees. A subsequent 
experiment may then include other plants, such as different types of vegetables. 

4. Measuring the response: 

The statement of the problem as in 1. above usually not only defines the obser¬ 
vational unit (OU) and the response variable but also the way in which the latter 
should be measured. Such measurements are either continuous or discrete. In 
some situations different types of measurement are possible, and a decision has 
to be made which one should be used. For example, to assess the damage due to 
pollution we may actually measure for each plant the damaged leaf area and the 
total leaf area and then obtain the percentage of damaged leaf area. This may be 
rather cumbersome. Alternatively, we may set up a scoring system, say a 5-point 
score, and then visually assign each plant a score, that is，put it in one of the 
five categories, that best reflects the amount of damage. Clearly, a continuous 
measurement is most informative. To approach such a measurement and keep it 
relatively simple at the same time, we may choose instead of the 5-point system a 
10-point system, say, realizing that such a more differentiated subjective scoring 
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system becomes less repeatable. The other extreme, of course, is a binary sys¬ 
tem - damaged versus non-damaged - which may be too crude to establish any 
differences among the pollutants under investigations. In summary, the choice 
of measurement may be important from a practical as well as from a statistical 
point of view. 

5. Specification of the error-control design: 

Identifying intrinsic factors and including several levels of one or more intrinsic 
factors will already determine to a great extent the type of error-control design 
that needs to be used, for example, some form of block design (see Chapter 9). To 
specify the design more explicitly, we need to identify the EUs and OUs and how 
the treatments are applied to the EUs. This is particularly important if there are 
several treatment factors. There may be different types of EUs (see Example 2.7) 
and hence different error-control designs, for example, block design (Chapter 9) 
versus split-plot design (Chapter 13). 

6. Formulating a model and aspects of the analysis: 

Although we shall discuss these topics extensively in later chapters, it is impor¬ 
tant to point out again that mapping out a model and at least parts of the ensuing 
analysis is a crucial aspect of planning an experiment. These considerations will 
tell us if and how the research hypothesis can be evaluated within a statistical 
framework. Among other things we can identify appropriate error terms to test 
statistical hypotheses or obtain confidence intervals for informative parametric 
functions. We then can assess whether the error terms are based on a sufficient 
number of degrees of freedom (d.f.) (see, for example, Sections 6.8 and 6.9,3). 
In the end these considerations may lead us to conclude that either the experi¬ 
ment as planned is satisfactory or that changes may have to be made to provide 
for a successful experiment. We cannot emphasize enough that the last point 
above represents really the culmination of the planning process. 


2.8 COOPERATION BETWEEN 

SCIENTIST AND STATISTICIAN 

We have just discussed in detail the various steps of a scientific investigation, with 
special emphasis on the planning of an experiment. This process requires a close co¬ 
operation between the subject-matter, scientist/investigator and the statistician. Below 
we shall outline some features of such cooperation, paralleling the points discussed in 
Section 2.7. 

1 . Statement of the objective: 

Research objectives and hypotheses originate in the context of research activities 
within a certain subject-matter field. Thus, formulation of such objectives is 
clearly the primary responsibility of the investigator. It is, however, never too 
early to contact a statistician if experimental work will be involved. The main 
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reason for this is to draw attention to the various steps of planning and executing 
an experiment. 

2 . Formulation of the subject-matter model: 

This aspect, too, is the primary responsibility of the investigator. This is, how¬ 
ever, already a good time for the statistician to raise questions about the desired 
and possible inference space of the results from the contemplated experiment. 
Sometimes “dumb” questions by the statistician will help the researcher to clar¬ 
ify and perhaps modify the aims of the experiment. In particular, considerations 
of potential intrinsic factors will draw attention to the size of the experiment. 

3 . Choosing factor levels: 

We have argued earlier that in order to make an experiment meaningful it is im¬ 
portant to choose the levels of the treatment and intrinsic factors with great care. 
Here again the statistician has to rely on the subject-matter knowledge of the in¬ 
vestigator. It may be desirable from a statistical perspective, for example, to have 
certain level combinations of the treatment factors present in the experiment, but 
such combinations may be undesirable or even impossible for biological, phys¬ 
ical or chemical reasons, or they may be difficult to achieve for purely practical 
reasons. In the end, compromises may have to be made to satisfy both statis¬ 
tical and subject-matter considerations without sacrificing the objectives of the 
experiment. 

4 : , . Measuring the response: 

Not many experiments are conducted in a complete vacuum, that is similar ex¬ 
periments have been performed previously. As a consequence, procedures have 
been agreed upon now to measure the response to treatments. It is generally 
desirable to conform to such procedures in order to make it possible to make 
comparisons among the outcomes of different experiments. Precedent and new 
ways to look upon the results of an experiment may, in fact, lead to using differ¬ 
ent response measures. This may have to be decided on practical and economical 
grounds. 

5 . Specification of the error-control design: 

This aspect of the experiment requires a close collaboration between the inves¬ 
tigator and the statistician. Here, questions have to be settled as to how the 
experiment should actually be performed. At this point a number of questions 
have to be answered: What are the experimental units (EU)? What are the obser¬ 
vational units (OU)? How homogeneous are the EUs? Can and should they be 
divided into more homogeneous groups (blocks)? Will the experiment be per¬ 
formed at different stages, that is at different times or different places? How will 
the treatments be assigned to the EUs? Will there be different such assignments? 
Answers to these and perhaps additional questions will help in selecting one of 
the error-control designs discussed in later chapters, or help in modifying one of 
those error-control designs in accordance with the needs of the experiment. An 
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important aspect of these considerations here is the identification of non-specific 
factors (see Section 2.2.4) in addition to the already chosen intrinsic factors. 

Representing one vertex of the statistical triangle of Figure 2.1, developing the 
error-control design sets the stage for actually performing the experiment. There¬ 
fore, there must be complete agreement between the investigator and the statis¬ 
tician on all points of the design. In order to facilitate communication between 
both sides and avoid misunderstandings, we strongly recommend to draw a dia¬ 
gram which represents a schematic picture of the physical layout of the experi¬ 
ment, similar to those in Tables 2.3 - 2.6. 

6 • Formulating a model and aspects of the analysis: 

The model for analyzing the data to be obtained from the experiment is deter¬ 
mined in large measure by the treatment and error-control designs, aspects of 
which we have discussed above. In fact, for each error-control design we shall 
show in later chapters how a linear model can be derived, and what assumptions 
have to be made. Such assumptions may involve the nature of certain interac¬ 
tions. Subject-matter knowledge can be of great help in deciding, in particular, 
which treatment-intrinsic factor interactions may be negligible. 

We cannot overemphasize enough how important it is to give careful thought to 
the basic elements of the statistical analysis and how the various elements relate 
to the various aspects of the research hypotheses. Hinkelmann (1963) describes 
an example where from what appeared to be a perfectly logical experimental 
setup (albeit different from the designs discussed in this book), not all of the 
researcher’s questions could be answered and how the situation could have been 
rescued if the analysis had been anticipated. 

7 . Performing the experiment: 

Although the investigator is responsible for performing the experiment, ideally 
the assisting statistician should be involved, too. Both should make sure that the 
agreed upon experimental protocol is being followed. For example, it is impor¬ 
tant to carry out the appropriate treatment randomization in order to avoid bias 
or confounding. Also, if it turns out that, in spite of careful planning, the exper¬ 
iment cannot be performed in its original form, ways will have to be found to 
modify or curtail the experiment such that most，if not all, of the original ques¬ 
tions can still be answered. An arbitrary curtailment without close consultation 
between investigator and statistician can lead to undesirable consequences and, 
indeed, failure of the experiment. 

S . Collecting and recording data: 

This is the final step prior to the formal analysis of the data. Not only is it im¬ 
portant to collect the data as carefully and completely as possible, following the 
established protocol, but also to label and organize them in close cooperation in 
order to facilitate the analysis, typically using some statistical software program. 
Although so-called missing observations can be handled in many situations by 
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statistical software packages, this does not provide a license for carelessness, 
since this may lead to needless complications in the analysis and its interpreta¬ 
tion. 

2.9 GENERAL PRINCIPLE OF INFERENCE 
AND TYPES OF STATISTICAL ANALYSES 

Our discussion in previous sections has made it clear that the analysis of data obtained 
from a designed experiment depends very heavily on a linear model which should 
reflect the structure of the experiment itself. In fact, formulation of such a linear model 
is a very important aspect in this whole endeavor and we shall return to this problem 
throughout the book. Once an appropriate linear model has been formulated, the next 
step will be to obtain the associated analysis of variance or, as the case may be, several 
analyses of variance. These then provide the basis for making statistical inferences as 
deemed appropriate for the particular situation at hand. 

2.9.1 General Model 

As mentioned earlier, a linear model for an experimental design contains generally 
three types of components: treatment components, design components, and error com¬ 
ponents. A linear model can be written more formally than (2.2) and 2.3) as follows: 

t b c d 

Y = 3" + Z Z U J 0 J + ^ Z k s k + ^ W 呷 (2.10) 

i=l 3=1 k=l 1=1 

where Y represents an 5 x 1 vector of observations, /x is an overall mean, and 
Ti = (rii, Ti 2 5 ..., TiatY is an ai x 1 vector of “treatment effects”（i = 1,2,..., t), 
f3j = (pji. 0 j 2 i …， PjbjY is a x 1 vector of “blocking effects”（j = 1,2...., b), 
Sk = .. • ， ^kc k ) x is a ca ： x 1 vector of experimental errors (k = 1.2,..., c), 

rfi = , rji^Y is a d/ x 1 vector of observational errors (l = 1,2...., d), 

0 is an 5 x 1 vector of unity elements, U) ， Z^, are known matrices of 
order s x ai(i = 1 ， 2, ... ， t)，s x bj (j = 1 ， 2,… ， 6 )， s x Ck{k = 1 ， 2,… ， c )， 
sxdi(l = 1,2,..., d), respectively. The matrices represent the treatment structure, 
e.g.，treatment factors and their interactions (and possibly treatment-intrinsic factor in¬ 
teractions), whereas the matrices JJj reflect the error-control design aspects, that is, the 
various blocking devices as suggested by the intrinsic and non-specific factors, and the 
matrices and reflect the error structure which is partly induced by the block¬ 
ing devices and various stages of randomization as well as the nature of EUs and OUs 
and the various types of errors associated with them and with the measurement and 
observation process. 

2.9.2 Outline of the ANOVA 

Based upon a model of the form (2.8) we can outline, albeit in not very precise terms, 
the general structure of the analysis of variance as given in Table 2.7. The basic parti- 
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tion of SS(Total) is into SS(Among EUs) and SS(Within EUs) with m — 1 and s — m 
d.f., respectively, assuming that there are m EUs. The SS(Among EUs) can then be 
partitioned further into SS(Among Treatments) with t. — ^2 ^ d.f. and SS(Among 
EUs Within Treatments) with m — t. d.f. Further partitioning of SS(Among Treat¬ 
ments) is possible and sometimes desirable, for example, when one is interested in 
testing hypotheses about certain treatment contrasts or when the treatments have a fac¬ 
torial structure. Thus, such partitioning is determined largely by the treatment design. 
The partitioning of the SS(Among EUs Within Treatments) is determined by the error- 
control design, which leads to various sums of squares associated with blocking factors 
and associated experimental errors as a function of the different randomizations. The 
different SS (Experimental Error) will, of course, be used to make statistical inferences 
about the treatment effects (examples of this will be provided later in the book). Finally, 
the partitioning of the SS(Within EUs) is determined by the various types of sampling 
and sub-sampling, that is, by the observational structure. 

We shall illustrate this general discussion with Study 1 (Arrangement I) given in 
Section 2.4.2. Model equation (2.4) can be expressed in the form (2.10) by way of the 
following correspondences: The fact that with respect to the pollutants the chambers 


Model (2.2) 

Model (2.8) 
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are the EUs implies that SS(s ： i) provides the appropriate error term (denoted as Error 1 
in Table 2.1) for testing hypotheses about pollutant effects. SS(^ 2 )，on the other hand, 
provides the error term (denoted by Error 2 in Table 2.1) for testing hypotheses about 
pollutant x family interaction effects. A sampling error is provided by SSir]^. 

As illustrated above, one important feature of the analysis of variance is the sepa¬ 
ration of systematic effects such as treatment and block effects from random or error 
effects. This is not only important in the context of hypothesis testing but also for estab¬ 
lishing confidence intervals and obtaining standard errors for treatment comparisons. 
Together with model equations of the form (2.10) and properties of (or assumptions 
about) the error components, the analy sis of variance of a properly designed experiment 
enables us to estimate the variance components , a^ 2 .... , a^ c and , cr^ 2 ， … ， a^ d 
(or linear functions of them) which can then be used as mentioned above. Knowledge 
about these variance components is quite often useful also to establish further experi¬ 
mental strategies such as determining the appropriate numbers of replications for each 
treatment and the amount of sampling within EUs. To summarize, statistical inference 
from experimental data，whether it is in the form of testing or estimation, is based on 
an underlying linear model. The method of least squares (Chapter 4) is then used to 
obtain estimates of pertinent parameters as well as the analysis of variance table. In all 
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Table 2.7 General Structure of Analysis of Variance 
for Model (2.10) 
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this, elements of randomization theory will play an important part. 


2.10 OTHER CONSIDERATIONS FOR 
EXPERIMENTAL DESIGNS 

Our main emphasis in this general discussion so far has been that an experimental 
design, consisting of a treatment design, an error-control design, and an observation 
(sampling) design, must be chosen in such a way that the investigator’s questions can 
be answered. In this connection we have elaborated upon the connection between the 
design and the associated statistical analysis. Now, in many situations a particular 
scientific objective can be met by different types of experiments. An example of this 
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was given in Section 2.6. In such a case an obvious question is: Which experiment 
setup should we choose? Or what is the “best” experimental design for this situation? 

Although the question is straightforward, the answer may not always be easy as 
different criteria have been developed to compare competing designs. We shall mention 
briefly some of these criteria. Some of them will be discussed in more detail later. 

One of the most important criteria is that of optimality or better, variance-optimality. 
By this we mean maximum precision (in some sense) in estimating linear combinations 
of treatment effects. Usually a functional of all such variances is minimized which has 
led to various optimality criteria, such as ^4-optimality, D-optimality, or ^-optimality, 
(for example, Kiefer, 1959) and it is not always clear which is the best criterion to use 
(see II. 1.13). 

A useful property for a design is that of orthogonality. It allows one to look simply 
at treatment means for purposes of comparisons. It also leads to a unique analysis of 
variance the sums of squares of which can be computed easily. All this may not seem 
very important in these days of high speed electronic computers. It does, however, 
make the interpretation of results easier and more transparent. 

Many of the existing and most commonly used designs are orthogonal, but if or¬ 
thogonality cannot be achieved, the property of balancedness is often sought. Here 
we are referring to variance-balanced designs in the sense that normalized treatment 
comparisons are estimated with the same precision. Other notions of balance exist, 
particularly in the context of factorial experiments, which are important in the whole 
discussion of experimental design (for example, Yates, 1935,1937; Shah, 1958; Preece, 
1982; and II. 12.5). 

There are many other criteria and properties that we could mention here such as 
connectedness, efficiency, and unbiasedness (see, for example, Federer, 1984), but we 
shall defer these to later chapters when the need for them will become more apparent. 

As pointed out earlier, one of the major objectives in designing an experiment is 
to estimate comparisons among treatment effects as precisely as possible. This can 
be achieved in a number of ways such as replication and blocking or refinement of 
experimental and measurement techniques. One other important device is to use sup¬ 
plementary information in the form of measurements on the OUs which are correlated 
with the final responses and not affected by the treatments applied to the units. This 
has the effect of “making the EUs more uniform” and hence of reducing the variabil¬ 
ity. The statistical technique to be used for this situation is the so-called analysis of 
covariance (see Chapter 8). It can be used in connection with all types of experimental 
designs. 

Supplementary information may not always be available and if available it may not 
always be advantageous or it may be too expensive to obtain. It is quite clear that in 
many instances of designed experiments cost considerations come into the picture. Un¬ 
fortunately, there is very little of concrete advice that we can offer the reader. Related 
to this problem we only make one point here: to keep the design as simple as possible 
as long as it is consonant with the investigator’s objectives. Simplicity is a requirement 
that affects the execution of the experiment as well as the analysis and the interpreta¬ 
tion of results. But simplicity is not an absolute term. The simplest experiment for a 
given situation may be rather complex indeed. 

The notion of simplicity is also tied to another concept which is perhaps the most 
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important among the ones that we have mentioned: range of validity or target popu¬ 
lation. If the range of validity is very narrow the experimental design can be rather 
simple. To give an example, if we want to investigate the effect of ozone at 5 ppm over 
a period of 30 days at 6 hours/day exposure on loblolly pine seedlings at age 20 days ， 
the target population is rather small and a completely randomized design (similar to the 
one described in Section 2.6) would be an appropriate design. If, however, we would 
like to extend the range of validity to natural forests and various forms of pollution, 
both the treatment design and the error-control design would have to be much more 
complicated, if indeed it can be done at all without making simplifying assumptions or 
limiting the target population. 

The reader will realize that the preceding discussion has not been exhaustive but 
rather sketchy. The purpose has been to point out the many facets, statistical and non- 
statistical, that one must be aware of when designing an experiment. Furthermore, it 
shows that it is not always possible to meet all the requirements and criteria and that, in 
fact, some are in conflict with each other so that compromises have to be made. Even 
though the basic ideas and principles of experimental design are well understood and 
appreciated, new criteria are being constantly developed (see, for example, Srivastava, 
1984) and need be incorporated in this field. 



CHAPTER 3 

Survey of Experimental Designs 
and Analyses: 

A Preview 


3.1 INTRODUCTION 


In the preceding chapter we have discussed, in general terms, the basic ideas and steps 
of scientific experimentation, how simple questions and speculations together with 
knowledge of the subject matter should eventually guide the investigator to a designed 
experiment which is based on sound statistical principles. We have touched on the 
major principles of randomization, replication, and blocking and their functions with 
respect to designing and analyzing an experiment. We shall pursue these ideas in much 
more detail, of course, as we discuss various forms of statistical designs in subsequent 
chapters. Our major aim in Volume I is to acquaint the reader, first of all, with a broad 
variety of error-control designs, treatment designs, and sampling designs so that, given 
a certain experimental situation, he or she can make a choice among various options, 
and make that choice intelligently. This means the reader must understand the proper¬ 
ties of the various designs, how they can be used to answer the researcher’s questions 
and how a choice of a design from among the possible ones will affect the answer. 

The following overview of statistical designs is intended to provide a catalog of de¬ 
signs to be discussed in Volumes I and II and also to describe, albeit somewhat superfi¬ 
cially at this point, the hierarchy of error-control designs in terms of their complexity, 
the nature of treatment designs, the connection between certain types of error-control 
and treatment designs, and finally, the sublety of sampling designs. We shall do this 
in a somewhat schematic way showing the progression from simple to more complex 
designs. It is useful to keep this in mind as one chooses an appropriate design, be it 
an error-control or a treatment design, because the choice of a design is often made 
difficult by conflicting ideas and principles. On the one hand one would like the design 
to be as simple as possible, mainly for practical reasons, but on the other hand one 
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would like to account for as many sources of systematic variation as possible, that is, 
use a more complex design, mainly for reasons of statistical inference. A compromise 
is often the final result. 

We shall conclude this chapter with some remarks about analyzing experimental 
data. 


3.2 ERROR-CONTROL DESIGNS 

Table 3.1 gives a list of classes of error-control designs in increasing order of com¬ 
plexity, where complexity is defined by the number of blocking factors for each class. 
The blocking factors correspond to different sources of systematic variation. The sense 
then in which the designs “control” the error is through the amount of blocking. Elim¬ 
inating, that is, blocking for, additional sources of systematic variation (using some 
intrinsic and/or non-specific factors) will lead to a reduction of the experimental error. 

The simplest design is the completely randomized design (see Chapter 6) with no 
blocking factors, that is, assuming essentially homogeneous experimental material (ex¬ 
perimental units). Next, a rather large class of designs is that of randomized block de¬ 
signs (see Chapter 9). As the name indicates for these designs we have one type of 
blocking, such as different litters, different breeds, different species, different sources 
of raw material, different manufacturers, and so on. The specific designs in this class 
are the complete block design, the generalized block design, and various forms of in¬ 
complete block designs which are characterized by the fact that each treatment occurs 
exactly once in each block, several times in each block, or not in every block, respec¬ 
tively. The complete and generalized block designs have a very simple structure and 
are easy to analyze and interpret. The incomplete block designs have a more intricate 
structure. Because of the fact that not every treatment occurs in every block these de¬ 
signs constitute what we shall refer to as nonorthogonal designs. Historically that has 
meant a more complicated analysis (for instance, various forms of analysis of variance) 
but in today’s computing environment that is no longer true. Nevertheless, these de¬ 
signs which are very versatile and flexible deserve special attention and although they 
are introduced briefly in Chapter 9, a much more detailed technical discussion of the 
properties, analysis, and construction is given in Volume II. 

The use of intrinsic factors will introduce additional blocking, leading to designs 
that we shall refer to as replicated randomized block designs. As we have already al¬ 
luded to earlier an important feature of these designs is that they allow the investigation 
of possible interaction between treatments and some or all intrinsic factors. 

An important principle in the design of experiments is that of a Latin square. In its 
simplest form this isditxt row-column array such that every one of t symbols appears 
exactly once in each row and in each column. In the context of experimental design, 
the rows and columns refer to two different blocking factors and the t symbols refer 
to the treatments (see Chapter 10). Thus, compared to the randomized block designs, 
we have one additional blocking factor. Furthermore, the structure obtained through 
the Latin square principle is such that the blocking (in two directions) is orthogonal, 
resulting again in a very simple analysis. The general class of Latin square type designs 
contains several specific designs, such as the Latin square design and the Latin rectan- 
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Table 3.1 Hierarchy of Error-Control Designs 


Number of 

Blocking 

Factors 

Class of Designs 

Specific Designs 

0 

Completely Randomized 
Designs 


1 

Randomized Block Designs 

Randomized Complete Block Design 
Generalized Randomized Block 

Design 

Incomplete Block Designs: 

Balanced Incomplete Block 

Design 

Balanced Treatment Incomplete 
Block Design 

Partially Balanced Incomplete 

Block Design 

Lattice Design 

Extended Block Design 

Trend-free Block Design 

>2 

Replicated Randomized 

Block Designs 


2 

Latin Square Type Designs 

Latin Square Design 

Latin Rectangle 

Incomplete Latin Square Design 
(Youden Square) 

Cross-over Design 

>3 

Replicated Latin Square 
Designs 


3 

Grseco-Latin Square Designs 


>3 

Mutually Orthogonal Latin 
Squares 
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gle design, which are, except for the degree of blocking, comparable to the complete 
and generalized block designs, respectively. Corresponding to the incomplete block 
designs, we now have incomplete Latin square designs where, as the name implies, the 
Latin square principle is not completely satisfied because, for example, the number of 
columns is less than the number of rows and treatments. This makes the requirement 
for the design less rigid, but it makes the analysis slightly more complicated. This is a 
reoccurring theme: We “gain” something on the one hand, but “lose” something on the 
other hand. 

The usefulness of Latin square designs can be enhanced through replications of the 
basic design. This enables us to take one more blocking factor into account which, of 
course, widens the inferential basis for the experimental results. Other extensions of the 
Latin square design using more than two blocking factors lead to designs in the form 
of mutually orthogonal Latin squares. An example of this is the Grceco Latin square 
design with three orthogonal blocking factors. Just as for the Latin square design, 
replications of the basic design will make them more useful. 

3.3 TREATMENT DESIGNS 

Each of the error-control designs mentioned in the previous section is used to compare 
t treatments with each other. So far we have not said anything about the nature of the 
treatments, and it is indeed not necessary to do so. Very often, however, the treatments 
are chosen to have some structure, in particular a factorial structure. This is what we 
have referred to as the treatment design. Just as the error-control design, the treatment 
design has to be chosen by the experimenter based upon the goals of the investigation 
and the experimental material and resources available. The chosen treatment design 
will then be embedded into an appropriate error-control design. 

For factorial treatment structures, we distinguish between symmetrical (pure) facto¬ 
rial structures (also referred to as symmetrical (pure) factorial experiments) and asym¬ 
metrical (mixed) factorial structures or experiments (see Chapter 11). For the symmet¬ 
rical structure, we have n factors each at s levels, say, where s is an integer. This is also 
referred to as an s n factorial. The most useful and practical values for s are 2, 3, and 4. 
For the asymmetrical factorial structure we have n\ factors at s\ levels, n 2 factors at S 2 
levels,.. • ， n m factors at s m levels, where the Si (i = 1,2,..., m) are different integers 
and > 1 (z = 1.2...., m). We refer to this as an s 7 ^ 1 x sg 2 x •. • x factorial. 
An important property of any factorial experiment is that it allows one to study not 
only the effects of the individual treatment factors, but also the interactions between 
treatment factors. The usefulness of factorial experiments rests, however, upon the fact 
that, typically, interactions involving several factors are nonexistent or negligible from 
a practical point of view. 

This observation is of great value in that it allows us often to reduce the size of 
the experiment by considering only a fraction of all possible treatment (level) combi¬ 
nations. Such an experiment is referred to as a fractional factorial. For 2 n , 3 n , and 
2 m x 3 n factorials we discuss the basic ideas of a fraction briefly in Chapter 11, but 
general methods of constructing various types of fractional factorials are discussed ex¬ 
tensively in Volume II (see II. 13 - 16). The difficulty in choosing appropriate fractions 
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is to ensure that essential information about interactions is not being lost. 

A schematic overview of these treatment designs and their hierarchy is given in 
Table 3,2. ^ ^ 

3.4 COMBINING IDEAS FROM ERROR- 
CONTROL AND TREATMENT DESIGNS 


It is important to be aware of and understand the properties and purpose of existing 
error-control and treatment designs in order to make appropriate choices fora particular 
experiment. But just as these two design aspects are important in and by themselves, it 
is imperative to understand how error-control designs and treatment designs have influ¬ 
enced each other in generating special error-control designs in particular for factorial 
experiments. Especially noteworthy here are incomplete block designs for complete 
factorial or fractional factorial experiments. A brief introduction for 2 n factorial exper¬ 
iments is given in Chapter 11, but the more technical and detailed discussion is deferred 
to Volume II (see II. 8-12, 13.8). Suffice it to say here only that these designs are con¬ 
structed by making use of the notion, mentioned above, that certain interactions among 
treatment factors are negligible and hence information on them can be sacrificed. 

Other examples of the interplay between error-control and treatment design are the 
various forms of split-plot type designs (see Chapter 13). This is a large class of designs 
in which the treatments have a factorial structure, typically with two or three factors. 
The essential feature of these designs is that the levels of the various factors are applied 
independently (using independent randomizations) to different types of experimental 
units by superimposing different error-control designs upon each other. For example, 
in a simple split-plot design the levels of one factor are applied to “large” experimental 
units in a randomized complete block design, and the levels of another factor are ap¬ 
plied to “smaller” experimental units in a randomized complete block design with the 
large units representing the blocks, that is, the experimental units for the first factor are 
split into experimental units for the second factor (hence the name split-plot). Many 
variations and extensions of this principle exist and are discussed in Chapter 13. As a 
special case, this contains also so-called repeated measures designs (see Chapter 14). 

In Chapter 12, we give a brief introduction to response surface designs. And even 
though these designs are not intended for comparative experiments but rather for ab¬ 
solute experiments, the notions of treatment design, that is, factorial experiment, and 
error-control design, that is, blocking, play a prominent role. As one interesting ex¬ 
ample of the simultaneous use of error-control and treatment design we mention the 
so-called Box-Behnken designs, the construction of which is based on essentially su¬ 
perimposing a factorial structure over an incomplete block design. 

And finally, we mention a class of designs which are constructed by using the 
notion of pseudo-factors, that is, by pretending that a factorial structure exists for the 
treatments when in fact it does not, to construct certain types of incomplete block 
designs for a large number t of treatments, where t is of the form; t = k 2 or t = k 3 or 
t = k(k — 1), etc., for some integer k. These are referred to as Lattice designs and are 
of practical value for agronomic experiments (see II. 18). 
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3.5 SAMPLING DESIGNS 

In Section 2.3.2, we have discussed the importance of the notions of experimental units, 
EU, and observational (or sampling) units, OU. In many experimental situations, the 
EU and OU are identical. However, if they are not the same, then this needs to be 
recognized and reflected in the analysis (an example of this is given in Section 2.6.2). 
Such a situation is referred to as subsampling and it can occur in connection with any 
error-control design. We shall discuss the consequences of subsampling in detail for 
the completely randomized design (see Section 6.9). The same arguments apply then 
to all other error-control designs discussed in this book. 

The most important feature of an error-control design with subsampling is that it 
allows the separation of experimental error and observational (or sampling) error, or 
more precisely, the separation, that is, separate estimation of the experimental error 
variance and the observational error variance. We shall discuss the statistical impli¬ 
cations of this fact in connection with making inference about treatment effects and 
comparisons among treatment effects. The possibility of being able to estimate the two 
types of variances may prove to be useful to the investigator in assessing the “quality” 
of the experimental and observational (measurement) procedures. Large variances may 
lead to a closer look at and, hence, to possible refinements of one or the other or both 
procedures. 

The notion of subsampling can obviously be extended to more than one level, for 
example, for each experimental unit we may have several sampling units and then for 
each sampling unit we may have several observational units. We refer to this situa¬ 
tion as sub-subsampling for obvious reasons. As an example, consider an individual 
as the experimental unit receiving a particular treatment; several blood samples, con¬ 
stituting the sampling units, are taken at one time from this individual, and duplicate 
determinations of, say，the blood sugar level are made, each determination represent¬ 
ing an observational unit. Such a scheme would enable the investigator to assess the 
variability due to three sources: experimental, sampling, and observational. 

Theoretically this can be extended even further, but this does not provide any new 
insight from the point of view of experimental design as discussed in this book. 

A schematic representation of the sampling designs described above is given in 
Table 3.3. 

3.6 ANALYSIS AND STATISTICAL 
SOFTWARE 

Following chapters will show that the notion of randomization is not only fundamental 
to physically performing the experiment but also to analyzing the data from such an ex¬ 
periment. This is accomplished by introducing the notion of design random variables 
which are then used to obtain a derived linear model which reflects the randomiza¬ 
tion procedure used for a particular error-control design. Such a model and the ensuing 
analysis of variance will then be used to formulate the randomization analysis due to R. 
A. Fisher (1926, 1935). This is a nonparametric analysis and, hence, does not depend 
on the often quoted normality assumption for experimental data. 
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Although we advocate the randomization analysis as the proper analysis, it is of¬ 
ten met with practical difficulties, because for most situations the number of possible 
randomizations becomes extremely large. We shall show how the randomization anal¬ 
ysis, that is, randomization tests, can be approximated by appropriate F-tests. At this 
point we shall then make use of existing statistical software for purposes of analysis, 
always keeping in mind, however, that this represents an approximation only, albeit 
a ’’good” approximation. Among the available statistical software packages we have 
chosen SAS®, a Statistical Analysis System (SAS Institute, Inc., 2002-2003), the use 
of which will be illustrated through numerous examples. 


3.7 SUMMARY 

The preceding discussion and enumeration of classes of error-control and treatment de¬ 
signs and combinations thereof by no means exhaust the list of available designs. Many 
speciality designs have been constructed and it would be impossible to list and discuss 
them all. We have, however, mentioned and we shall discuss in subsequent chapters 
the major classes of designs and their properties, how they are constructed, how they 
are analyzed, and how they are applied. The major point here is that for many experi¬ 
mental situations special designs may have to be constructed and that this can be done 
easily by having a firm understanding of the notions of blocking, incomplete blocks, 
the Latin square principle, the split-unit (split-plot) principle, and factorial treatment 
structure. They are the building blocks of almost all designs, and it is the aim of this 
book to elucidate them in a rigorous way, emphasizing the mathematical and statistical 
aspects. 
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CHAPTER 4 


Linear Model Theory 

4.1 INTRODUCTION 

4.1.1 The Concept of a Model 

The greatest intellectual achievement, perhaps, of the twentieth century has been the 
development of the concept of model and the use of that concept. A model is an 
explanation of observables in terms of observables. Explanations are of various types. 
The most simple is, surely, the notion of descriptive explanation; so, for instance, to use 
an ancient statistical example, height and weight of human adults of one or the other sex 
follow approximately a bivariate normal distribution. Or at an even more elementary 
level, with s denoting distance and t being time then with certain units of measurement, 
s = \gt 2 , where p is a constant of gravitivity. Models can be static in the sense that 
they describe a situation. They can be dynamic in the sense that they tell us, given the 
truth of the model, what will happen in, say, the future, the prime example being those 
arising from differential equations, such as dy/dt = at + b, where t indicates time. 
The whole area of differential equations and partial differential equations is concerned 
with what may be reasonably called dynamic models so that from a starting point, the 
differential equation, one can, hopefully, obtain the solution which tells us the outcome 
over the relevant space, for instance, physical space and time. Models can be classified 
in another way as being merely explanatory or causal. If we envisage a variable y 
as being affected by a variable x, and we can, furthermore, envisage a comparative 
experiment in which the variable x is controllable and is observed at various prechosen 
levels, then we can reasonably regard a resulting explanation, y — f{x), where f(x), as 
a special case of (2.2), is some function such as ax, £n x, sin x y or whatever, as being 
a causal explanation or a causal model. Clearly, the imputation of the explanation 
or model having a causal basis must in the last resort have an experimental, that is, 
interventional, basis. 

An approximate model is one in which a variable (of arbitrarily general form) is 
approximated by a function of variables deemed appropriate, or merely being available. 
So, for instance, we may have the variable y denoting the weight of a child, and we 
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have a mathematical formula 


y = 80 + 0ix-r ( 3 2 z 

in which we approximately describe y in terms of two variables, x = sex, and z = age. 
As we shall see later, a very common situation is that we have a vector variable y, say 
n x 1， which we wish, for one reason or another to describe approximately in terms of 
other vector variables, xi, X 2 , … ， x p , and we seek a description 

y = Axi + 3 2 x 2 + — h Ppx p . 


In contrast to the notion of the previous paragraph which is solely in the realm of 
approximation and approximation theory, we have the notion of a stochastic model. 
In this case we have, for example, a random variable, Y, which has a probability dis¬ 
tribution, and we wish to describe the distribution of Y ; for instance, that Y follows 
A r (/i, a 2 ), the normal distribution with mean g and variance a 2 . There is, of course, no 
limit to the nature and complexity of such stochastic models. One that is surely easy 
to imagine and think about is that we have a sequence of random variables Yi, > 2 . ••• 
and we wish to characterize in some way the joint distribution of the random variables. 
In such problems, we may have some variables that are considered, in our proposed 
explanation, to be predetermined, or exogenous, the variables for which we need an 
explanation being called endogenous. In the case of stochastic models general develop¬ 
ment involves two entirely different directions, with different mathematical techniques. 
It may be that we have a basic probability model that is specified mathematically, and 
we then have to derive by mathematical reasoning the probability distribution of vari¬ 
ables that result from numerical processes from the variables in the basic and given 
probability model. All this leads, of course, and as many readers will realize, to the 
whole realm of probability theory, stochastic processes, and so on. This is one of the 
two directions and it requires certain easy and many not-so-easy types of mathematical 
analysis and technique. The other direction is that we have to envisage, by one route or 
another (perhaps rather naive and perhaps very sophisticated mathematically) a prob¬ 
ability distribution, perhaps specified only to a partial extent, or perhaps completely. 
Then our task is to do one or both of the following: (a) Obtain observations accord¬ 
ing to some investigative plan (or even with no plan at all, except that we accept what 
our observation process gives) and then (b) follow procedures of statistical analysis to 
estimate, that is, form judgments, preferably, in objective ways, of aspects of the prob¬ 
ability distributions, with the assumptions that our given data comprise realizations of 
random variables. 

What we have described above, characterizes, in a sense, all that goes on in all 
branches of science, though what we have given is a short picture that would require 
huge amount of writing to exposit in reasonable detail. 

What is really happening in this whole broadly specified process is a two-branched 
operation. In one branch, one abstracts what one envisages as being relevant variables 
into mathematical entities, variables of one sort or another, one abstracts properties of 
these real-world variables, and one develops consequences of this mathematical struc¬ 
ture that one has abstracted. Then one examines the real world, and one has mea¬ 
surement processes: one decides, or merely hopes, that the result of a measurement 



4.2. REPRESENTATION OF LINEAR MODELS 


73 


process on the real world，say X, can be regarded as a correlate of an element x in 
the associated mathematical abstracted structure. In other words, one makes epistemic 
correlations of real world observables to entities in the associated mathematical struc¬ 
ture. In general, the whole process is very complicated. Just to take a very simple case ， 
consider temperature, as a measurable attribute of a specimen, and temperature as it 
appears in some mathematically formulated theory. The process of developing modes 
of observation and drawing on measurement protocols is itself a result, nearly always ， 
of some interplay of formal (or perhaps, very informal) theory and observation. 


4.1.2 Comparative and Absolute Experiments 

We are concerned with the area of design and analysis of experiments, and more specifi¬ 
cally, with comparative experiments. The use of the restricting adjective “comparative” 
is very natural in that we are concerned with entities called treatments which can be 
applied to experimental units, for example, children, cows, plots of lawn, and pieces of 
steel, and we are concerned with determining differences between treatments with re¬ 
spect to outcome or response variables, which we shall, often, call yields; for example, 
with children under different treatments beginning at the age of 6, we are interested in 
height and weight at age 8, the latter being yields or outcomes or response variables. 

In contrast to the comparative experiment we have the absolute experiment. In this 
case we have observations on a presumed constant (or set of constants) and our task is 
to determine it; for example, the charge on an electron, or the life curve of a species or 
race of mice. 

The most widely used approach to the comparative experiment, and to a large ex¬ 
tent, to the absolute experiment, is the use of linear models to which we now turn. 


4.2 REPRESENTATION OF LINEAR 
MODELS 


We suppose that we have a variable y to be explained in terms of variables xi ) X 2 >... 

We have units or entities, such as, human subjects, plots of land, and pieces of steel, 
on which we have observed each of the variables y ， 工 1 ，巧， … ， x p . Suppose we have 
observed n units. We may then represent our data set as an n x 1 vector y, y '= 
( 2 / 1 , y 2 ? …， Un) which is to be explained by means of the columns of what we call the 
model matrix 

(xn 
X21 

X=. 

\*^nl 




^2p 


= (x 1 ,x 2 ,...,x p ), 


where xi, X 2 ；.... x p are vectors. A linear model is given by the equation 

y = X/3 = /?iXi + 々 2 x 2 + ... + （ 4.1) 

in which the coefficients fh ， 02 ” •， ， are either given or are to be determined, and 
there are no relationships among the coefficients H . •., j3 p . We use = to be a 
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shorthand for “approximately described by.” In contrast to a linear model, we might 
have occasion, for instance, to consider the model 

y = ,5lXj + 0'iX 2 +03X3, 

which is nonlinear in the parameters (3\ and j3s. Equation 4.1 will be used to encompass 
two types of model: 

(a) An approximative model in which we wish to represent a given vector y by a 
linear form in xi, x 2 .... ； x p , so that the problem is then strictly one of defining 
a distance between two n x 1 vectors, say y and z and then obtaining that z 
which is a linear combination of xi ， X2, , x p which is nearest to y, a problem 
that clearly lies in approximation theory, perhaps elementary. 

(b) A stochastic linear model in which y is a random vector and 

y = X/3 + e ， 

where X/3 is some fixed vector, to be estimated in one way or another and e is a 
random vector following some distribution. 

43 FUNCTIONAL AND CLASSIFICATORY 
LINEAR MODELS 

4.3.1 Functional Models 

If we measure, say humans of age 21， with y being weight, x\ height, being adult 
height of the male parent and if we wish to explain or fit y by means of a model on x\ 
and ^ 2 , the values of the explanatory variables can take a continuum of values. It is 
useful to give such models the name functional models. 

A general problem that arises in observational studies in which one merely observes 
the explanatory variables and these are continuous variables is when the explanatory 
vectors Xi, X 2 ,x p are nearly linearly related by one or more relations of the sort 

7 iXi + 72X2 + … + 7 pXp = 0, 

where 71 , 72 , • • • ,7p are constants and 0 is the n x 1 vector of zeros. This is called the 
problem of multicollinearity. It leads to considerable difficulties (for example, Myers, 
1990). This problem is of great concern when the explanatory variables are numerous 
and related in some way not necessarily known or even partially understood as in many 
economic studies, for example. 

4.3.2 Classificatory Models 

In contrast to the previous case, the individuals with y-values which we wish to fit by 
the model may be classified according to factors of classification: for example, with 
an experiment in blocks and treatments (see Chapter 9) the observational units may be 
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classified by the block classificatory factor and the treatment classificatory factor. In 
such cases a classificatory linear model is considered of the type 

y = block effect + treatment effect 

or more conveniently for many purposes 

y = block effect + treatment effect, 

where is some (appropriately defined) constant. 

Formal representation of this as a linear model is achieved by the following type of 
language. Let the units be indexed by u = 1, 2.... ,n. Let x(u ， z) = 1 if unit u is in 
block i and let z(u.j) = 1 if unit u has treatment j, these variables being otherwise 
equal to zero. Then a linear classificatory model in simple scalar form is 

y u = x(u, l)/3i + x(u, 2)/3 2 十 ..• + x(u, b)pt 

+ z(% 1)7*1 + Z(U, 2)7*2 H - h z(u, t) 丁 t 

b t 

= J2x(u,i)0.i + '^z{u,j)r j , 
i=l j = l 

where Pi (i = 1 ， 2,... 6) represents the effect of the zth block and Tj (j = 1, 2,..., t) 
the effect of the jth treatment. In matrix form this can be written as 

y = + X r T, 

where and X r are n x 6 and n xt matrices, respectively, of known constants and 
(3 = (H …， 0b) f ， 丁 = ( ti , T2 , ... ,r t y are vectors of unknown parameters. One 
may wish to include a constant term and would then have 

y = 3 n fi + Xd/3 + X r r ? (4.2) 

where J n is the n x 1 vector of unities. 

The significant aspect of models of the form (4.2) is that every element of X， the 
model matrix, is equal to 0 or 1. Then, additionally, because every unit is in one and 
only one of the blocks and receives one and only one treatment, we have relation¬ 
ships like 

XgJb = 0 n , X r 3 t = ^ n . 

From one point of view we have in the model multicollinearity of a very simple type re¬ 
sulting from the fact that the model contains contributions, combining additively, from 
subsets of the data resulting from imposition of classifications into disjoint subsets. 

4.3.3 Models with Classificatory and Functional 
Components 

A more general class of linear models has both functional and classificatory portions. 
A very simple example occurs with a block-treatment classification if an additional 
“continuous” variable has been obtained: so, for instance, we might have 


Vij 士 M + A + 勹 + ， {x i：h 
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where observations are indexed by ij, {/JJ are block effects, {rj} are treatment effects, 
and Xij is an observation on the ijth individual, such as initial weight in a growth 
feeding experiment. Continuous variables that are adjoined to a classificatory linear 
model as potential explanatory variables are given, for some quite obscure reason, the 
name concomitant variables, or covariates (see Section 4.13 and Chapter 8). 


4.4 THE FITTING OF y = X/3 

In formal terms the problem of fitting a model of the form (4.1) may be represented 
as follows: determine (3 and X/9 such that the badness of fit of y by X/3, denoted 
by BF(y, X^), is minimized. We shall not give a general discussion of this general 
problem. Instead we shall take, for our purposes, 

X/3) = (y — X/3) / (y - X/3), (4.3) 

and we refer to this as least squares fitting. We shall suppose at this point that (3 may 
be any vector in R p , that is, the components of (3' = (/?i ， 卢 2 , ..., f3 p ) may be any real 
numbers. 

4.4.1 The Notion of Identiflability 

Before proceeding with this, we ask a simple question: Suppose one is actually given 
the vector X/3 with knowledge of X but not of /3. Can one determine a linear function 

A ; /3 = Ai/?i + ^202 + ... + Ap/3p 

for given values Ai ， A 2 ；..., A p ? As background for the question, suppose we are given 
that, with 0 i = [i ； 82 = a-i, 0 ^ = a 2 , 

— 5 

"+ = ‘7 

can we determine a\l This question can be thought about in very simple terms or in 
nonelementary but nonadvanced terms. Here is a geometric way of doing so. 

We suppose X/3 = X/3 0 , that is, there is a vector /3 0 which gives the vector X/3. Is 
then 乂 (3 necessarily determined to be 入 ’/3 0 ? A slightly sophisticated way of thinking 
about this is to note the following: The equation X(/3 - f3 0 ) = 0 is equivalent to the 
vector 0 - /3 0 being perpendicular to every vector that is the transpose of a row of the 
matrix X. With 

/v[\ 

V2 

X = (X]_，X2，. . •，Xp) ^ . 

Wn/ 

we say: The row space of X is the set of all vectors arbitrary real 

numbers, and we denote this by i? = i?(X). Also A ; (/3 — f3 0 ) = 0 says that /3 — /3 0 is 
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perpendicular to the vector 

入 2 

A =., 

which we write as (/3 — 0 0 ) 丄入 . Then we can see that 

(/3-/3 0 ) 丄 i?(X) 


must imply 

(0 - /3。） 丄入 • 

This happens if and only if A G i?(X) or X f = a’X for some vector a, or 入 =X a. 

In the case of an approximative linear model, y = X/3, we say X /3 is identifiable 
if A 7 = a’X for some vector a. 


4.4.2 The Notion of Estimability 

In the case of a stochastic linear model 

y 二 X/3 + e 

with £ ； (e) = 0, where E(.) denotes the expectation or expected value, we say that 入 ’/3 
is linearly estimable if there exists a vector a such that 


E( a 'y) = V/3 

or because 

^(a / y) = ^[a / (X/3 + e)] 

= E[si f Xf3 -h a 7 e] 

=a ， X/3 + E(8i f e) 

= a ; X/3 

we say, given that /3 is a completely free vector, that f3 is estimable if there exists a 
vector a such that 

a，X = Y 


or, again, A G R(X). 

4.4.3 The Method of Least Squares 

Now we proceed to the least squares fitting and give a sequence of results. 

1. By differentiation of (4.3) with respect to the unknowns, /?i, /? 2 ,…， /? P ，we get 
the normal equations (NE) 

X’Xb = X’y, (4.4) 

where we use b to denote the variable in the equations. 
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2. These equations are consistent for all y € i? n ; that is, all vectors y’ = (yi, y 2 , ... ， Vn) 
where each yi can be any real number. Hence solution vectors b exist for any 
such vector y. 

3. Whatever solution we take, the vector Xb is unique given a particular vector y. 

This is so because for two solutions to (4.4), say bi and b 2 , we have X’Xb!= 
X 7 Xb 2 which implies X / X(bi — b 2 ) = 0 and hence 

(bi - b 2 ) / X , X(bi - b 2 ) 二 0 
or 

(Xbi - Xb 2 ) / (Xbi - Xb 2 ) = 0, 


from which it follows that Xbi — Xb 2 = 0. 

4. No matter what solution vector b we take, BF(y' Xb) is the minimum value of 
(y-X/3y(y-X/3). 

5. Because Xb is unique, the NE necessarily give a unique answer for a given y 
for any identifiable or estimable function A'/3. In fact , 入 then is such that there 
exists a solution to the conjugate normal equation: X’X/9 = 入 ， and the fit for 

is 

6. There is，from (2) above, a vector such that 

X，Xbi = 


where 

e；- = (0 • • • 0 1 0 … 0) (i = l ， 2 ， ... ， n). 

T 

ith position 

So there is a matrix B = (bi , b2 , ••• ， b n ) such that 

X^B = X^e! e 2 ...e„) =X ， I„ = X ，， 
and XB is unique. 


7. From (4.5) we have 

BUB = B'X 1 

and transposing yields 

BUB = XB. 


(4.5) 


So BX is symmetric and idempotent (s.i.p.). This matrix XB is determined 
solely by the matrix X and we write 

Px = XB (4.6) 


with 

= P x = (4.7) 

these encompassing the symmetric idempotent properties. Furthermore 


P x X = XBX = BXX = X. 


(4.8) 
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8 . Premultiplying 

— X，Xb = X ; y 

by B ; we get 

BTXb = B，XV 
or, using (4.6), (4.7), and (4.8), 


Xb = P x y- 


9. We write 

y = Pxy + (I - Px)y 

=a x +a 0 , (4.9) 

where ax represents the fit of y by Xb and a 0 represents the residual. Then 
a&ax = y’(I - Pxj'Pxy = 0 . 

So Pxy and (I — Px)y are n x 1 vectors that are perpendicular and 

y'y = (Pxy)'(Pxy) + [(I - Px)y]'[(l - Px)y] 

= y’Pxy + y’(i —Px)y. (4.10) 

10. This gives the very simple analysis of variance: 

Explanatory Source Degrees of Freedom Sum of Squares 

X rank(X) y’Pxy 

Residual n — rank(X) y ; (I — Px)y 

Total n y f y 

11. We attach a number, the degrees of freedom (d.f,) associated with the explanatory 
source, X, equal to rank(X), the rank of X, which is equal to the row rank of X 
or the column rank of X or the determinant rank of X. Also 

rank(X) = rank(Px) 

because 

Px = XB. sorank(Px) ^ rank(X) 

and 

X = PxX, so rank(X) ^ rank(Px)- 

A rationale for the number of d.f. is as follows: The fit for y is Xb for some 
b. Writing Xb = 61 X 1 + 62 X 2 + • • • + 6 p x p , we say that Xb is in the column 
space of X, denoted by C(X). Now C(X) is a space of dimension r = rank(X). 
Similarly, the residual is (I — Px)y which is restricted to (7(1 — Px) which is 
a space of dimension rank(I — Px) which is equal to n — r, because X’[(I _ 
P x )y] 二 0 for every y. So the elements of (I — Px)y are n linear forms which 
are restricted to be null in r ways. 
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12. We must emphasize that the matrix Px which is utterly intrinsic in the math¬ 
ematics of the least squares approximation is not at all essential to the actual 
numerics of least squares fitting. We are given the NE: X 7 Xb = X'y and we 
have to find a solution, it does not matter which because Xb is invariant. We 
shall describe below how one may adjoin conditions on solutions in the form of 
Cb = c (which we may take to be 0), so as to get a definite solution to the NE. 

13. If b is a solution vector, we have 

y’Pxy = (y ; )(Pxy) = y f Xb = b f X f y. (4.11) 

The well-established phrase associated with this is: 

The sum of squares removed by the fitting of y by X/3 is equal to the sum of 
products of a solution of the NE and the right-hand sides of the NE, that is，the 
inner product of a solution vector and the right-hand side of the NE. 

14. It is worth noting that rank(X) is equal to the dimensionality of C(X), that is with 
X = (xi, X 2 *.. •, Xp) so that Xi is the zth column of X, the set of all vectors 

: ai a real number| = C(X). 

Given the vectors xi ， X 2 .... : x p , we can find a maximal set of linearly indepen¬ 
dent vectors 

{^1 5 ^2，•••，（，} 

such that 

r 

= 0 implies = 0 (z = 1. 2,.... r) 

i=l 

and every vector is given by 

r 

為. 

j=l 

The number of vectors in such a maximal set is the column rank of X. 

We have seen that Px = XB, P x y = X(By); so Px takes a vector y into a 
vector that is in C(X). Further, 

(y - Pxy)’Pxz = y’(I - Px)Pxz = 0 

so Px projects y orthogonally onto C(X). 

15. It is also clear that there exist maximal sets of independent row vectors in X. 
One merely “works down” the row vectors keeping in a list those that are linearly 
independent of previous ones on that list. Necessarily，any such maximal set has 
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r members, where r = rank(X). This tells us that there exists a set of linear 
functions of (3, .... A 7 r /3, such that with 


(3 = M(3 


it is the case that 

Xf3 = ZA f f3 = ze, 

where Z is n x r of rank r. Also, if /V/3 is identifiable or estimable then 

V/3 = v'e 


(e{\ 


/Ai\ 

e 2 

= 

•^2 



\Kj 


for some u. In other words: a linear model y = X/3 of rank r on a parameter 
that is a p-vector (p ^ r, of course) can be written as a model y = Z0 where Z is 
n x r of rank r, and 0 is a set of r linearly independent identifiable or estimable 
functions. This process is called reparametrization to full rank. Clearly, it can 
be done in many ways. We shall see that some ways are more natural or more 
convenient than others. 


4.4.4 Theory of Linear Equations 

We have seen that to obtain the fit for any identifiable or estimable function we merely 
have to obtain any solution of the NE: X 7 Xb = X ; y and then if b* is any such 
solution, the solution for Xb is Xb* and the solution for any identifiable function 乂 (3 
is A’b' The question is, therefore, to exhibit ways of getting one solution. 


1. We now give a few basic ideas on the theory of equations. A necessary and 
sufficient condition for the equations Ax = d with unknown vector x to be 
consistent can be expressed equivalently as 

(a) d G (7(A) or 

(b) rank(A|d) = rank(A) or 

(c) i/A = 0’ implies u'd = 0. 

2. Notions of generalized inverses of matrices are useful. Consider the equation 
with a given real matrix A in an unknown matrix X, 

AXA = A. (4.12) 


It is solvable because of the following: 

(i) From basic matrix theory, there exist invertible P and Q such that 


PAQ = 
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or 

A = P_1 S) Q ' (4 . 13) 

where r = rank(A); and 0 are null-matrices of appropriate dimensions. 

(ii) Take such P and Q. Then, take 

i) p - (4 . i4) 

where J, K, L are arbitrary but of appropriate dimensions. It is easy to 
verify that AXA = A and hence X of (4.14) is a solution and any solution 
is necessarily representable in this way. 

(iii) Suppose A is of rank r and the submatrix consisting of rows ot\, 0 , 2 ,.. 

a r and columns .... /3 r is A and is invertible. Then we can obtain 

an X satisfying AXA = A by making up X from A -1 by inserting this 
in rows H . …爲 and columns ai, a 2 ,..., a r and inserting zeros 
everywhere else (see Example 4.1). 

Any solution X of AXA = A is called a generalized inverse (or g-inverse 
for short) of A and is usually denoted by A 一 ， even though it is not unique. 

3. Let A— be a particular generalized inverse of A. Then 

(i) the equation Ax = d is consistent if AA _ d = d, 

(ii) any solution to a consistent equation is of the form 

x = A~d + (I-A~A)z (4.15) 

for some z. 

4. Hence, if we have to solve the NE, X’Xb = X’y，we can find an (X’X) _ by 
the procedure in (2) above and then take as solution 

b = (X'XrX’y. (4.16) 

It is the case necessarily that 

X(X / X)~X / = Px 

because with X’XB = X’ and Px 二 XB = = P^- we have 

X(X / X)^X / = B / X / X(X / X)~X / XB = BXXB = P^P X = Px 

5. Another process for obtaining a solution to the consistent equation Ax 二 d is 
merely to adjoin consistent equations Cx = c so that the augmented equations 
have a unique solution. In our case, then, given the NE ， X'Xb 二 we adjoin 
conditions on solutions: Cb = c. It is the case that 
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(i) If the equations in b are to be consistent, then, from 1(c) above, we must 

have that z^X’X + u f 2 C — 0 / implies z^X’y 十 v f 2 c = 0. If we must have 
consistency for any conforming y, we must have that z^X’X + v f 2 C = 0 ; 
implies u f 2 c = 0 and = 0 / which implies i^X’X = O', — 0. 

So, of course, Cb = c must be consistent in b. 

(ii) This condition on C is, in slightly sophisticated terms, the condition 

R{C) n R{X f X) = {0} 


or 


because 


i?(X) n R{C) = { 0 } 

B ， X，X = X 


implies 


which with 


i?(X) C i?(X 7 X), 
i?(X ， X) C i?(X), 


gives 


R{X , X) = R(X). 

Hence the prescription is clear: We adjoin to the NE the equations Cb = c, 
which are consistent (which we can accomplish merely by taking c = 0) 
and which are such that the only identifiable or estimable function z/C/3 is 
the null function and such that rank(C) = p-r, the column rank deficiency 
of X. We shall give examples of this process later. 


The method of obtaining a 沒 -inverse as described in 2.(iii) above is used often in 
statistical software to obtain a solution to the NE (4.4). We shall return to this point in 
Section 6.11, but give a simple example here to illustrate the procedure. 


Example 4,1: For the model (4.2) with 6 = t = 2 we obtain X’X in (4.4) as 


/4 

2 

X'X - 2 
2 

V 2 


2 2 2 2 \ 

2 0 11 
0 2 11 = A say 

112 0 
110 2 / 


and, obviously, rank(A) = 3. Then 
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with rank(A) = 3 and 


We then obtain 



hr = (x ， x)- 


/ .75 -.5 

-.5 1 

0 0 

-.5 0 

\ 0 0 


0 -.5 0\ 

0 0 0 

0 0 0 

0 1 0 

0 0 . 0 / 


□ 


4.5 MOORE-PENROSE GENERALIZED 
INVERSE 

We know from (4.5), (4.6) and (4.7) that there exists a U such that 

A'AU = M, P A = AU = = Pa. 

Similarly, there exists a V such that 

AA，V = A, A V = P A , = P: = Pa- 

Now consider the matrix A + = V^AU，which is unique because AU and A ; V are 
both unique. Clearly, this matrix A+ is determined uniquely by A. It is an interesting 
matrix because 

(i) AA+A = A. 

(ii) A 十 AA+ = A+. 

(iii) AA+ is symmetric and idempotent. 

(iy) A + A is also symmetric and idempotent. 

The properties (i)—(iv) above can be verified easily by making use repeatedly of 
(4.8) and the properties of Pa and Pa / above: 

(i) AA+A = AV AUA 二 AUA 二 A. 

(ii) A+AA+ = V AUAV AU = V'AVAU = V，AU = A+. 

(iii) (AA+y 二 （ AVAU)，= (AU)，= AU 

(iv) (A+A)’ = (V’AUA)’ = (V’A)’ = V’A, 
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Furthermore A* is the unique solution to the equations in X 

AXA = A 
XAX = X 
(XA)' = XA 
(AX)，= AX. 

This matrix is called the Moore-Penrose generalized inverse of A, or, now briefly, the 
M-P inverse of A. The M-P inverse has the following basic properties: 

⑴ (A0 + = (A+)'. 

(ii) The shortest solution, that is, x such that x’x is minimized, of the consistent 
equation 

Ax 二 d 


(iii) (A ， A)+ = A+(A + y 

This gives us directly, 

(iv) The shortest solution of the NE: X ; Xb = X’y is 

b = X+y. (4.17) 


(y) The vector perpendicular to the hyperplane set {/3 ： C/3 = c}isC 十 c ‘ 
We see, of course, that A + is a particular generalized inverse of A. 


4.6 CONDITIONED LINEAR MODEL 

4.6.1 Affine Linear Model 

Consider the model y = X/3, with/3 not free in R p but restricted by consistent equality 
conditions, C/3 = c, but otherwise free. Clearly we can write for any particular choice 
ofC~, 

(3 = C~c + (I - C _ C )7 for some 7 
or 

(3 = C+c + (I — C + C )7 for some 7 . 

So this restricted model is transformable to 

y = XC _ c + X(I- CTC )7 
or 

y = XC+c + X(I - C + C) 7 . (4.18) 
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Ainv=ginv(A); print Ainv; 

MOORE-PENROSE INVERSE 
AINV 

0.0625 0.03125 0.03125 0.03125 0.03125 

0.03125 0.265625 - 0.234375 0.015625 0.015625 

0.03125 - 0.234375 0.265625 0.015625 0.015625 

0.03125 0.015625 0.015625 0.265625 - 0.234375 

0.03125 0.015625 0.015625 - 0.234375 0.265625 


Either of these is appropriately called an affine linear model. 

The fitting of this is really quite routine because with y* = y — XC+c the NE for 
7 in (4.18) is 

[X(I - C+C)]’[X(I - C + C)]7 = [X(I - C+C)] V 
with shortest solution [see (4.17)] 

7 =[X(I-C + C)] + [y-XC + c], 

Hence, as one may verify, the shortest solution to the least squares fitting of the whole 
problem is 

b = C+c + [X(I — C + C)] + [y- XC+c]. (4.19) 

This mode of procedure is of considerable interest with regard to the mathematical 
structure that is being investigated. It does require, however, determination of M-P 
inverses and these are not at all easy to find, in general. Computer programs to do this 
exist, of course, such as SAS/IML (SAS Institute, Inc., 2002-2003). 

The following example serves as an illustration. 


Table 4.1 Moore-Penrose Inverse 



Example 4.2: Using the matrix A from Example 4.1 with the GINV function in 
SAS/IML yields, together with the input statement, the Moore-Penrose inverse given 
in Table 4.1. □ 
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4.6.2 Normal Equations for the Conditioned Model 

A quite different method of tackling the problem is to use Lagrange multipliers and 
differentiation. This gives the following equations, which we define to be the NE asso¬ 
ciated with y = X/3, C/3 = c, 

(? c 0 )UH x : y ), (调 

in which the vector m is a vector of undetermined multipliers. We now state basic 
properties of this NE (4.20): 

(i) It is consistent for all y and all c such that C/3 = c is consistent. 

(ii) The vector X/3 is invariant over all solutions. 

(iii) Any part solution b gives a minimum of (y — X/3)’ (y — X/3) subject to Cf3 — c. 

(iv) The minimum sum of squares is 

y r y — b’X’y — b’C’m. 


Interesting and relevant properties of matrices involved are 


⑴ rank 


fx ; x a 


X 、 


{ c 0 J=rank^ c j+rank(C). 


(ii) Suppose a generalized inverse of the coefficient matrix is given by 


fx，x c ’、 十 —/D E\ 

V C 0J = \F GJ , 

then a solution of (4.20) is 

b = DXV + Ec 
m = FX’y + Gc 

and 

Xb = XDXV + XEc. 

Furthermore XDX r is symmetric idempotent and is, in fact, equal to 


[see (4.19)]. 


X[X(I - c+c)] 
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4.6.3 Different I^pes of Conditions 

An interesting point arises if C/3 is not identifiable or estimable. In that case u[X 4- 
i/ f 2 C = 0 ; implies = O’ ， 4C = O’. It follows then from (4.20), which can be 
written in part as (Xb - y)’X + m’C = 0 ’， that m’C = 0’ and the minimum sum of 
squares is the same as that which would be obtained with just y = X/3, that is, without 
the restriction, C/3 = c. Also, we may note that if R(C) fl iZ(X) = {0}, that is, no 
nontrivial i/C/3 is identifiable or estimable, and rank C = p — r, the rank deficiency 
of X, then 

d 

is invertible and implies 

(X’X + C f C)b = XV + C’c (4.21) 


with solution 

b = (X'X + C'Cr^X'y + C'c). (4.22) 

A particularly important special case of the preceding occurs when C = A’X so 
that for every it is the case that v'C/3 is identifiable or estimable. In that case, in 
(4.20), 

X，Xb + C，m = X'y 


gives 


X，Xb + X， Am = X ; y 


or 

Xb = P x (y-Am). (4.23) 

Then 

c = Cb = A，Xb 二 A / Px(y - Am) 

so that 

A P x Am = A’P x y - c = A’P x (y — X/3 0 ) (4.24) 

for /3 0 such that C(3 0 = c. Then (4.24) gives a unique solution for PxAm, which can 
be written as 


PxAm = P x A(A / P x A)-A / P x (y- X/3 0 ) ~ Q(y - X/3 0 ), 
which, substituted into (4.23), gives 

Xb - Pxy - Q(y - X/3 0 ). 

The minimum sum of squares is 

(y- Xb) / (y-Xb) = y , (I- Px)y + (y - X/3 0 ) , Q(y - X/3 0 ). (4.25) 

This is the sum of squares of residuals plus 

(y- X^ o yP x A(A'P^A)-A'P^(y - X(3 0 ). 
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But 

AP x y = A'Xb = Cb. A ， P x X/3 0 = = c, 

where is the fit for A ; /3 in the model y = X/3, that is, the one without the 
restriction, and the additional sum of squares of deviation is then 

(Cb - c) / (A / P x A)-(Cb-c). 

Here, we may, of course, also use (A’PxA) + and if, as would normally be the case, 
C = A’X is of full row rank, then A^xA is of full rank. Note that we can also write 
A’P X A = A / X(X / X)~X / A = C(X / X)-C / . 

It is clear from the above that if we have the model: y = X/3, Ci^ = ci, C 2/3 = 
C 2 , with i?(Ci) C R(X), and i?(C 2 ) A i?(X) = { 0 }, then the space of possible fit 
vectors {Xb: b E R p } is not restricted by the conditions C 2 b = C 2 . In other words, 
given bi such that Cibi = Ci, C 2 bi = C 2 , we can find b 2 such that Cib] = Ci ， 
but C 2 b 2 = 62 + C 2 . Obviously, then the restriction C 2 b = C 2 has no impact 
on the goodness of the representation of y by a vector Xb. More formally, with the 
model specified in the preceding, we have, with Ci = A^X, that the LS fit for X/3 is, 
according to (4.23 )， 

Xb = Px[y - Aim] 

with m satisfying (4.24), and it will be the case that 

{Xb: Cib = Ci, C 2 b = C 2 } = {Xb ： Cib = ci, C 2 b = 62,62 7 ^ C 2 }. 

4.6.4 General Case 

We can now describe very succinctly the situation with general conditions on param¬ 
eters, C/3 = c. Necessarily, there exists with C of dimensions q x p, a partitioned 
matrix 

T 二 

of order q x q and invertible, such that 

i?(TiC) C R{X) 
i?(T 2 c)ni?(x) = { 0 } 

and the conditions C/3 = c have impact on the fitting (whatever “reasonable” criterion 
of badness of fit is used) only with respect to the “portion ”： TiC/3 = Tic. Further¬ 
more, the conditions T 2 C /3 = T 2 C have no impact on the resulting fit of by Xb 
for some vector b. 

A small final point merits a little consideration. How do we find Ti and T 2 ? A 
possible procedure, not necessarily optimal with regard to computing, is as follows: 
Obtain first a basis for i?(X). Let the matrix of this be X. Then consider 
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Reduce this to row echelon form. Then we shall obtain a matrix C as the lower nonnull 
part of the row echelon form. Then C/3 = c, C/3 — c, and C/3 is not identifiable or 
estimable and the conditions C/3 = c are not restrictive at all. The conditions C/3 = c 
are equivalent then to C/3 = c and TiC/3 = Tic. We are not giving here any attention 
to the question of how best to perform the operations that are involved. 


4.7 TWO-PART LINEAR MODEL 

4.7.1 Ordered Linear Models 

Suppose we wish to consider explaining or approximating a variable vector y by means 
of a linear model on explanatory variables -^ x p , and Zi ， 之 2 , ..., 之 Then we 

will wish to ask: 

Do the variables 之 i, 之 2 , •. • ，之 g help in explaining y after we have used xi, X 2 ,.. 
x p l 
or 

Do the variables Xi ， X 2 ： • •• - x p help in explaining y after we have used zi ， Z 2 ,..., z q l 
We may address these questions with matrix language by considering two ordered two- 
part linear models: 

y-XA + X 2 /3 2 (4.26) 

and 

y = X 2 f3 2 + X 1 f3 v (4.27) 

Here the order of writing is strongly relevant. To address this, taking the model (4.26) 
we can contemplate fitting y by Xi/3 1? for which we are then looking at the residual 
(I — Pxjy [see (4.9)]. Then, multiplying (4,26) by (I — PxJ, our model would give 

(I-P Xl )y = (I-P Xl )X 2 /3 2 . (4.28) 

If now we fit (4.28) by the method of least squares we get the NE 

X^(I- P Xl )X 2 b 2 = X^(I- P Xl )y. (4.29) 

Rather naturally, we call (4.29) the reduced normal equation, RNE, associated with 
X2/3 2 cifter Xi/^. Since (4.29) is a normal equation [that is, of the type Z’Z0 = 
Z’y], it follows that (I — Pxi)X 2 b 2 is uniquely determined. We achieve then the 
representation, based on (4.28), 

y 士 Px!y + (I — PxJX 2 b 2 . (4.30) 

In the same way, we have the RNE for Xt/3^ after X2/3 2 , using model (4.27), which is 

Xi(I - PxJX^! = X;(I - P X2 )y. (4.31) 

The overall minimum sum of squares is then obtained from (4.30) as 

[(I - P Xl )y - (I - P Xl )X 2 b 2 ]'[(l - P Xl )y - (I - P Xl )X 2 b 2 ] 

- y'(i - Pxjy- 2/(1 - P Xl )(i - P Xl )x 2 b 2 + b ㈤ (I - P Xl )X 2 b 2 
- y ; (l - Pxjy- b ; 2 x' 2 (i - P Xl )y. (4.32) 
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We note that in (4.32) y’(I — Pxjy is the minimum sum of squares for fitting the 
model y = Hence (4.32) says that the additional sum of squares removed is 

the inner product of a solution vector of the RNE (4.29) and the right-hand vector of 
the RNE. 


4.7.2 Using Orthogonal Projections 

We shall now develop the situation by the use of P-matrices, projection matrices. We 
use s.i.p. as an abbreviation for symmetric idempotent matrix. 

We have the following: 


⑴ There exists Bi such that X^XiBi = and P Xl = XiBj is s.i.p. with 
Px^i = Xi and Px x is the orthogonal projector onto C(Xi), the column 
space of Xi. 


(ii) There exists B 2 such that X’ 2 X 2 B 2 = X’ 2 and Px 2 = X 2 B 2 is s.i.p. with 
Px 2 X 2 = X 2 and Px 2 is the orthogonal projector onto C(X 2 ). 

(iii) There exist Bi,B 2 such that 




or 


and 


(g: 勸 (I) = ⑵ 仰 ) 


P Xl x 2 = (Xi:X 2 ) 


(I! 




X1B1 4 - X2B2 


is s.i.p. with P Xl x 2 (Xi ： X2) = (Xi ： X 2 ) and P Xl x 2 is the orthogonal projec¬ 
tor onto C(Xi ： X 2 ). 


For brevity of writing, we now use 



Pxi = Pi, Px 2 = P 2 . PX 1 X 2 = P 12 • 


Then 

I = Pi + (Pi2-Pi) + (I-Pi 2 ) 


and 

y = Piy + (P 12 - Pi)y + (I - Pi 2 )y 



三 ai + a 2 .i + ao -12 (by definition). 

(4.34) 
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Table 4.2 (X 2 |Xi) — ANOVA for the Ordered 
Model y = Xi^ + X 2 /3 2 


Explanatory Source 

d.f. 

Sum of Squares 

Xi/3 x 

^1 

y，Piy 

X2/9 2 after 'X.i/3 1 

r 12 - ri 

y’(Pi 2 - Pi)y 

Residual 

n - r 12 

y , (i-Pi2)y 

Total 

n 

y’y 


Then we have decomposed the vector y additively into three orthogonal vectors ai ， a 2 .i ， 
and ao. 12 . We believe the reader will recognize immediately the rationale for the nam¬ 
ing of these that we use. A little thought gives 

P12P1 = Pi, so P1P12 = Pi by transposition, 

Pi(I - P 12 ) = 0, 

(Pl2-PlY = (Pl2-Pl), 

(Pl 2 -Pl ) 2 = (Pl 2 -Pl), 

(I-p 12 ) 2 = (i-p 12 ). 

So Pi, P 12 - Pi and I - Pi are each s.i.p. and we then have from (4.34) 

y'y = aia ： + a;. 1 a2.t'+ a^). 12 a 0 . 12 

=y’Piy + y'(Pi 2 - Pi)y + y , (I - Pi 2 )y. 

This is nothing but the ANOVA corresponding to the ordered linear model: y = 
Xi/3 x + X2/3 2 which is normally presented as in Table 4.2 where r\ = rank Xi, 

V\2 — rank(Xi ： X2). 

We see that the degrees of freedom associated with y’(Pi 2 — Pi)y are the degrees 
of freedom associated with (P 12 — Pi)y, which is rank(Pi 2 — Pi) = trace (Pi 2 — Pi) 
(because P 12 — Pi is s.i.p.) = trace P 12 — trace P 1 = rank(Pi 2 ) — rank(Pi)= 

rank(Xi ： X 2 ) — rank(Xi). 

We can abbreviate our naming of sources, clearly, to the following: 

Xi 

X 2 |Xi 

I|X!X 2 

a usage we shall find very handy. 

Clearly, rather than fitting 'X ， i/3 1 first and then X2/3 2 to the residuals, we can fit 
X2/3 2 first and then Xi/3 a to the residuals. This corresponds to the identity 

I = P 2 + (P 12 -P 2 ) + (I-P 12 ) 
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Table 4.3 (Xi |X 2 )—ANOVA for the 


Ordered Model y = X2/3 2 + ^iPi 


Explanatory Source 

d.f. 

Sum of Squares 

X 2 


y’P 2 y 

Xi|X 2 

r \2 - r 2 

y’(P 12 - P 2 )y 

I|X 2 X! 

n - r l2 

y ; (i-Pi 2 )y 

Total 

n 

y’y 


and 

y — a 2 + 3-1.2 + ao. 12 . 

We then get from this decomposition the ANOVA of Table 4.3. 

We note that the residual sum of squares of Table 4.2 and 4.3, y’(I — Pi 2 )y, is the 
same as (4.32). 

How do we use these ANOVAs? We suggest that this is rather obvious. If we need 
X2/3 2 after Xi^ x , then it is the case that 


55(X 2 |X!) 

55(I|XiX 2 ) 

is “large.” Just what we mean here by “large” will be clarified later (see Section 4.17). 


4.7.3 Orthogonal ANOVA 

A small question is “When are the two ANOVAs the same apart from naming?” This 
happens only if 

y'(Pi 2 — Pi)y = y'Poy, for all y 
and 

y’(P 12 — P 2 )y = y/Piy，for all y 
or 

Pl 2 = Pi + P 2 

or 

XiP 12 X 2 = XiPiX 2 + X / 1 P 2 X 2 
or, using the properties of P 12 , Pi ， P 2 ， 

xix 2 = xix 2 + x;x 2 


or 


X;X 2 = 0. 
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Conversely, if X^X 2 = 0, it is clear from (4.33) that 

Pl2 = Pi + P2 

and we have just one ANOVA (see also Section 4.11). This is referred to as orthogonal 
ANOVA. 

Little addenda to the above are as follows: 

(a) P 12 - Pi is the orthogonal projector onto C[(I - Pi)X 2 ], and 

(b) P 12 — P 2 is the orthogonal projector onto C[(I - P 2 )Xi] 

4.8 SPECIAL CASE OF A PARTITIONED 
MODEL 

Consider the model: y = 0^ + X^, where D f = (1,1,..., 1) with n components. 
Then we see that 

P 3 = -33 , . (4.35) 

n 

The RNE for X.f3 is 



We obtain (4.36) merely by performing least squares fitting on y with the mean y 
subtracted from each y and with the mean of each column of X subtracted from the 
elements of that column. 

The sum of squares for the explanatory source Ofi is y^^y, which is nothing but 
点 y’33’y, or ^ [square of total of y], a quantity commonly called the correction factor 
in ANOVAs “around the mean •” 

4.9 THREE-PART MODELS 

Here we consider 

y = Xi/3i + X2/3 2 -f X3/33 

with six ordered three-part linear models, one for each of the six orderings of Xi, X 2 , 
and X 3 . We have thus six ANOVAs, which we represent in Table 4.4. To explain 
what the associated sums of squares are we use Pi, P2, P3. P12 ： P13, P23 and P123, 
extending in an obvious way the results of Section 4.7, and merely give an example: 

55(X 2 |X 1 X 3 ) = y / (Pi 23 -Pi 3 )y. 

It is rather easy to see that the matrix of any sum of squares is s.i.p. The third ANOVA 
of Table 4.4 corresponds to the identity 

I = P2 + (Pl2 - P2) + (Pl23 - Pl2) + (I — P123 )， 
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Table 4.4 The 6 ANOVAs with an Unordered Three-Part Model 




Explanatory Sources 



X x 

Xi 

x 2 

X 2 

X 3 

X 3 

X 2 jXx 

X 3 |X! 

Xi|X 2 

X 3 |X 2 

Xi|X 3 

X 2 |X 3 

x 3 !X!X 2 

X 2 |X!X 3 

X 3 |X!X 2 

X 1 |X 2 X 3 

X2|X!X 3 

XJIX2X3 

I|X x X 2 X 3 

I|X!X 2 X 3 

IIX^oX.3 

I|X 1 X 2 X 3 

I\x 1 x 2 x 3 

I|X 1 X 2 X 3 


giving 


y = a 2 + a l-2 + a 3.12 + a 0.123. 


that is, the vector y has been decomposed additively into four vectors that are orthog¬ 
onal. 


4.10 TWO-WAY CLASSIFICATION 
WITHOUT INTERACTION 

We have given earlier [see (4.2)] the data model structure 

y = 3/j,- {- X r r + X c c, (4.37) 

where X r and X c are incidence matrices with X r 3 = 3, X c 3 = 3, the 3-vectors hav¬ 
ing appropriate dimensions. This represents a special case of a three-part model, but a 
very important one in the context of comparative experiments (see Chapters 9 and II. 1). 
In this case we are interested, as we shall see, only in two ANOVAs. We are interested 
in the extent to which incorporation of X r r and/or X c c improve approximation of y 
by 3/x. We then have only two relevant ANOVAs: 

J 0 

X r |3 X c \3 

X c |3X r X r |^X c 

I|3X r X c I|3X r X c . 

A very natural question is whether we shall have 

SS(X c |3X r ) = SS(X C |3), 

which then induces and is equivalent to 

SS(X r |3X c ) 二 SS{X r \D). 

It is clear from Section 4.7 that if this is to happen we must have 


/(P rc - P r )y= y ， (P c -P 3 )y ， 
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where is the orthogonal projector onto C(3), P r is the orthogonal projector onto 

C(X r ) = C(3:X r ), P c is the orthogonal projector onto C(X r :X c ) = C(3:X r :X c ). 
The equality must hold for all y so we need 


This gives 


or 


or, using (4.35), 


P rc — P r = P C — 

Ki^rc - Pr)X c = X ； (P C - P 3 )X C 
0 = X / r X c -X ； P a X c 


X ； x c = -(X ； 3)(3 , X C ). 
n 


(4.38) 


Since the elements of X;X C are the numbers of observations for each row-column 
combination, (4.38) tells us that we have one ANOVA if the frequencies are propor¬ 
tional. Contrariwise，if X^X C = X’ r P 3 X c ，then with P rc = X r B r + X C B C , where 


， x;x r x' r x c 、O f x， A 


、 x ， c x r X^X C/ 




(4.39) 


with Xj,X r B r = X ; r , X^X C B C = X’ c , P r = X r B r , P c = X C B C , we obtain from 
(4.39), premultiplying by B 7 r and B: ， respectively, 


X r B r + P r PgX c 6 c = P r 
P c P0X r B r + X C B C = P c . 


(4.40) 

(4.41) 


But 


PrP ： J= P：)，PcPa = 

so, adding (4.40) and (4.41 )， 

Prc + = P r + P c . 

Since P rc P 3 = P 3 , this implies 

Prc + P^= Pr + Pc 


or 

Prc — Pc = Pr — Pa 

so that we have just one ANOVA. Hence, proportional frequencies are a necessary and 
sufficient condition for both ANOVAs to be identical. 
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4.11 K-PART LINEAR MODEL 


4.11.1 The General Model and Its Sums of Squares 

Consider the linear model 


y = Xi/ 3 i + X2P2 + … + 

Clearly, there are kl associated ordered /opart linear models. Taking the original order, 
we have 

P1 * P12 ； Pl23，. • • ， Pl2.../c 
and 


I = Pi + (P 12 - Pi) + … + (Pl2 … fc — P 12 … 口 ） + (I - P12 … fc )， 


giving 

and 


y = a l + a 2.1 + • • . + a 0.12 … fc 


y’y = a；ai + + … + ^^. 12 ^a fc . 12< ^ + aQ. 12 ^a 0 .i 2 ...fc. 

This may be presented as an ANOVA: 


Explanatory Source 


Xi 

X 2 |Xx 

X 3 |X 1 X 2 


X^|XiX 2 ... X ^-1 
I|XiX 2 … Xfc 

I 


in which degrees of freedom and sums of squares may be written down at sight. Every 
sum of squares is obtainable by considering for some Zi, Z 2 

y = Zi7 X 
y = Zi7 2 4 - Z 2 72 


with respective sums of squares 

7fZ iy , 


where Z^Zi^ 
where 


=Ziy, and 


^Z^y + ^Z^y, 

(Z 1 ； Z 2 ) , (Z 1 ； Z 2 ) ⑵ =(Z 1 ： Z 2 ) , y 


(4.42) 


(4.43) 
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and then taking (4.43) minus (4.42). For example, to obtain SS(X 3 |XiX 2 ) we take 

Zi = ⑵ 

and 

Z 2 = X 3 , 7 2 =/3 3 . 

The difference (4.43) minus (4.42) can then be expressed as 
SS(Z 2 |Z 1 ) = SS(Z 1 Z 2 )-SS(Z 1 ) 

or, alternatively, as 

SS(X3|X!X 2 ) = SS(X!X 2 X 3 ) - SS(X!X 2 ). 

This shows that the sums of squares are obtained by fitting sequentially “larger” mod¬ 
els, that is, 

y = XA 

then 

y = Xi/3i + X 2 f3 2 

and then 

y = Xi/ 3 i + X 2 / 3 2 + X 3/33 

etc., and obtain SS(Xi), SS(XiX 2 ), SS(XiX 2 Xs) etc” respectively, and then 
SS(X 2 |X!) = SS(X!X 2 ) - SS(Xi) 
and 

SS(X 3 |XiX 2 ) = SS(XiX 2 X 3 ) - SS(XiX 2 ) 

etc. It is for this reason that the sums of squares in the table above are often referred to 
as sequential sums of squares in the context of the /c-part linear model. We emphasize 
again, that the use of P-matrices is particularly valuable for mathematical purposes. 

Among the k\ orderings for the fc-part linear model we can identify k orderings 
such that for the zth ordering Xif3 { occurs in the last position (i = 1,2,... ? k). Then 
the last sum of squares of the sequential sums of squares is 

SS(Xi|all other X^) = SS(XiX 2 ...X fe ) - SS(all except X^) 

for i = 1, 2,..., fc. These k sums of squares are referred to as partial sums of squares. 
These play an important role for nonorthogonal models, that is, models for which con¬ 
ditions corresponding to those given in Section 4.7.3 do not hold, for testing hypotheses 
involving (see, for instance, Sections 8.3.5, 8.8, 9.8, 9.10, and 13.4). 

For later reference we mention here that for the Statistical Analysis System (SAS) 
package (SAS Institute, Inc. 2002-2003) the sequential sums of squares correspond to 
the Type I sums of squares and the partial sums of squares correspond to the Type III 
sums of squares. 
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4.11.2 The Means Model 

It is relevant to give a little detail of alternative modes of specification of models. 

The simple case which we have denoted by y 士 十 X ly 3 (see Section 4.8) where 
Xi is a n x p binary (0, 1) matrix, can be written as y = X17, because D = 3 n = 
Xi3 p , so 7 = /3 + Opfi. When written in the latter mode, the rank of the model equals 
rank (Xi), and there are no dependences that the model y = 3/U + Xi/3 contains. This 
model without dependences is called a cell means model (e.g., Hocking 1985, 2003) or 
means model for short, which we shall write as y = X/i. 

In the case of the two-way classification without interaction, we have written the 
model as y = 3^ + Xi/3 x + X 2 ^ 2 . We use this because it leads, automatically and 
directly, to the two relevant and interesting ANOVAs. Alternatively, we may write this 
in the form of a means model as 



Vijk ~ 

= Mij 

(4.44) 

with conditions on the model as 




Mu 

—Pi. — A 


(4.45) 

for all i, j, or, as 

f^ij - 

fMf — 

V j H - = 0 

(4.46) 


for all i, i f , j, f. 

We have given earlier, modes of presentation of a conditional linear model. To 
write either of the alternative models in matrix form and then to construct the nor¬ 
mal equations is an unpleasant chore. Additionally, to go from the conditional means 
model, say 


y = Xp 
C/i = 0 


with C of the form appropriate to (4.45) or (4.46) to consideration of what linear forms 
in f3 x and in /3 2 are identifiable is awkward. 

We want, of course, to write models for higher classification data structures. Con¬ 
sider a three-way structure with observations indicated by i = level of factor 1, j = 
level of factor 2, fc = level of factor 3, and l = level of observation within levels i, j, k 
of factors 1, 2, and 3. We prefer the model written as 

y = Xi/3 x 4 - X 2 / 3 2 + X 3 冷 3 + Xi 2 / 3 12 + Xi 3 / 3 13 + X.2302S + X 123 戸 123 , 

in which we include the possibility of interactions between factors 1 ， 2, and 3. How are 
we to represent the case of no triple interaction? The only alternative way is to write 
the means model as 

Vijki = IMjk (4.47) 

and then adjoin conditions, such as 

f^ijk — fMjk，— fMfk. — + = 0 (4.48) 
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for all i, i\ j, j\ fc, . To write the model with no interactions we have to write (4.47) 
and (4.48) with, also, 

f^ijk ~ ~ fHj’k + = 0 ， for all i. j. j , k, k 

IMjk — (M’jk 一 f-^ijk' = 0, for all i. i , j. k* k 

and 

fMjk — fH’jk — "ij’A ： + = 0, for all i, i f j. j , k. 

To write this as 

with conditions on fi, given by Cfi = 0 is exceedingly awkward and space filling. 

We suggest, by way of this brief discussion, that using means models is, in general, 
not good data analysis procedure. Such models do not, without additional notation and 
commentary, reflect the structure of the data and hence are not suggestive concerning 
tests of hypothess or estimation of linear functions of interest to the researcher. For ex¬ 
ample, model (4.44) can just as well represent a data structure where factor 2 is nested 
within factor 1. Similarly, model (4.47) could represent, among several possibilities, a 
data structure where factor 3 is nested in factor 2, and factors 1 and 2 are crossed. Such 
ambiguity can lead easily to inappropriate inference and hence to erroneous conclu¬ 
sions and explanations of the data (see also Section 4.12.7), although the proponents of 
the means model argue contrariwise (Hocking and Speed, 1975). 

Models of the form we prefer are said to be overparameterized models. We discuss 
such models in more detail in Section 4.12.7. We shall show that for balanced classifi- 
catory data structures the perceived disadvantage of overparameterized models can be 
dealt with very easily. It is only for certain types of unbalanced data structures that the 
means model seems preferable (see Section 4.13.2). 


4.12 BALANCED CLASSIFICATORY 
STRUCTURES AND ANALYSIS OF 
VARIANCE 

In the previous sections we have considered linear models from a rather general point of 
view without distinguishing between what are usually referred to as regression models 
and classificatory or classification models. The reason, of course, is that the math¬ 
ematics associated with these different models is really the same. In the context of 
comparative experiments, however, classificatory models play a major role. It is useful 
then to digress briefly from the general discussion and introduce and illustrate some 
important concepts for classificatory models. More specifically we want to discuss 
certain data structures, how such structures lead to classificatory linear models, and 
associated analyses of variance. 
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4.12.1 Factors ， Levels，and Partitions 

The analysis of variance (ANOVA) was first developed for what we call balanced clas- 
sificatory structures. We have a set of individuals which can be partitioned or classified 
by one or more factors of classification. So human individuals can be classified by 
country of birth, by sex, by religion of ancestors, and so on. A factor of classification 
thus gives a partition of the set of individuals into disjoint subclasses. We denote a 
factor of classification by a capital letter, say, A, and we call the subsets produced by 
the factor the levels of the factor. So with any one factor every individual in the set 
possesses a level of each factor. Obviously, the set can be partitioned into subclasses 
by whatever number of factors is given by the data. So with two factors, which we de¬ 
note by A and B, every individual in the whole set possesses a level of each of the two 
factors. There is a critically important attribute of the relationship between two factors 
or two subsets of the totality of factors. Clearly, a combination of factors is itself a 
factor because, for example, we can partition the set of individuals by two factors, A 
and B, and every individual is at a particular level of factor A and a particular level 
of factor B, so every individual is at a particular level of the joint factor AB. So then 
with four factors A, B,C, and D, we have four single factors, A, B,C, and D and we 
have six factors involving two factors, AB, AC, AD y BC, BD, and CD ，and we have four 
factors involving three factors, BCD ， ACD, ABD, and ABC, with one factor involving 
all four simple factors which we denote, naturally, by ABCD. We call the constituent 
factors in any joint factor by the name of the single factors. 

4.12.2 Nested ， Crossed，and Confounded Factors 

Now consider two factors. Let these be sex, 5, and continent of birth, C, Then there 
are individuals in every subclass formed by the joint factor, SC. In contrast, let the two 
factors be county of residence and state of residence which we denote by C and S" 
respectively. For exposition we assume that the states and counties of the USA have 
different names, which is not “quite true.” Then there is only one county called Story 
County and one state called Iowa, and Story County is in the State of Iowa. It is 
obvious then that all individuals in Story County, having the same level of the factor 
county, have the same level of state, namely Iowa. In this case we say that the factor C 
is nested by the factor 5, or, equivalently, that the factor S nests the factor C. Clearly, 
in this situation, we cannot have county Story in the state of Virginia so the combination 
(Story, Virginia) is not a possible combination. 

It is clear that in this case, the joint factor CS gives the same partition as the factor 
C. The same type of relationship can be considered with joint factors, say AB and CD. 
So we can say, for example, that the joint factor AB nests the factor C, or that the joint 
factor AB nests the joint factor CD. And so on. 

If two factors A and B or two joint factors, say, AB and CD are such that one does 
not nest the other, they are said to be crossed. 

There is a third type of relationship between two factors. Suppose we have t EUs 
and t treatments, and that we assign each of the treatments to one unit. Then it is 
clear that the factors, units and treatments, are not crossed and that one is not nested 
by the other. We say that units and treatments are completely confounded. We say this 
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because whatever difference we observe between the units receiving, say, treatment 1 
and treatment 2 can be explained by the difference caused by the difference between the 
treatments, or by the difference between the units on which treatment 1 and treatment 
2 fell. 

4.12.3 The Notion of Balance 

We now turn to the matter of balance. We have a data set consisting of a set of indi¬ 
viduals which may be humans, mice, pieces of engineering equipment, for example, 
which can be classified by a factor A. If the number of individuals at each level of A is 
the same for all levels, we say that the data set is balanced with respect to factor A. If 
we have two factors A and B, we say that the data set is balanced with respect to (joint) 
factor AB if the number of individuals at each level of AB is the same for all possible 
levels of A and B. 

With data in a classificatory factorial structure, it is natural to index individuals by 
the levels of the several factors which the individual possesses. So with only one factor, 
A, we may index individuals by i(j), or simply by ij without causing any confusion, 
in which i denotes the level of the factor A of the individual and j indexes individuals 
within the subclass of individuals having level i of A. We may note that i(j) indexes 
subclasses which each consists of one individual. 

4.12.4 Balanced One-Way Classification 

We have just one factor of classification, A, and we index the levels of this factor 
by i = 1, 2,… ，儿 Our use of the letter A for two purposes should not cause any 
confusion. To use different letters is easy but very tedious. We suppose that there are r 
individuals in each subclass, we denote the totality of observations by 

{yij: i — 1,2..... A. j = 1,2,..., r}. 

Now we note that we can form an average within each i class, which we denote by yi, 
and an overall average which we denote by y... Then, obviously, we have the identity 

Vij = y.. + {Vi. -y..) + {yij - Vi.)- 

Then we can form 

+ - y..)' 2 + 'Y^yi 3 - Vi.) 2 plus sums of cross products. 

ij ij ij ij 

But the sums of cross products give zero: 

= 5 .工 E(H) = 0 

ij j i 

V-XVij ~ Vi.) = V-- 人 Uij — 5i.) = ◦ 

ij i j 

^{yi. - y.XVij ~Vi.) = - V-) - Vi-) = °- 

ij i j 
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Table 4.5 ANOVA for One-Way Classification 


with Equal Numbers 


Source 

d.f. 

SS 

CF 

A 

Residual 

1 

A-l 

A(r — 1) 

rAy 2 

- y ..) 2 
~ yi .) 2 

Total 

rA 



We have further 

= r 部 2 = CF ， 

where CF is referred to as the correction factor, and 

_ y -) 2 = r Y^i. -v -) 2 = ss (^) 

ij i 

- Vi.) 2 = SS (Units within 4) 

ij 

thus giving the ANOVA of Table 4.5. The d.f. can be obtained as follows: 

(i) CF is the square of one linear function so CF has 1 degree of freedom. 

(ii) rE{(yi t — y,) 2 is the sum of squares of A linear functions \/r(yi. — y..)^ — 

1 ， ... ， A，which are connected by one linear relation — y.) — 0. So 

this sum of squares is said to have (A - 1) degrees of freedom. 

(iii) Finally, / 勿 —— y^.) 2 is the sum of squares of Ar linear functions that are 
connected by A linear relations, Ej (yij — yi) = 0, i = 1 ， …， A and hence is 
said to have Ar - A = A{r - 1) degrees of freedom. 

Usually the term CF is subtracted from Total without renaming so that 

new Total = ^ yfj — rAy 2 with d.f. = rA - 1. 

4.12.5 Two-Way Classification with Equal Numbers 

We have a data set classified by rows (R) and columns (C) with n (彡 1) observations 
within each row-column cell. The observations are indexed by i = 1 ， ， .. ， i?; j = 
1, ■,. ， C; and = 1,..., n, that is, yijk. We can now have the identity: 

Vijk — y... + (yi.. — y...) + {y.j. ~ y...) 

+ iVij. ~ Vi.. ~ y.j. + y...) + (l/ijk — Vij .)， 


(4.49) 
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in which 


y … 


RCn 


^ ^ Vijk ， Vi.. 


ijk 


Cn 


Vijk 

jk 


and so on. 

Then just as in the simple case in Section 4.12.4 we obtain by squaring both sides 
of (4.49) and summing over all indices 


y 2 nk = + 5Z(n.) 2 + - 钇 ..) 2 

ijk ijk ijk ijk 

+ 知 . _ yi- •— 勾 .j. + 5...) 2 + — Vij.) 2 - ( 4 . 50 ) 

ijk ijk 

The cross product terms in the square of the sum yield zero; for example, 

^2iVi.. - y...){yij. - Vi., -y.j. + y.J 

ijk 

= 〉:〉 '人 Vi.. ~ y...) 〉 '人 Vij. — Vi. •- V,j. + 5 ...) = 0 

k i j 


because by the “equal numbers” - Vi.. - y.j. + y...) = 0. 

The constituent terms on the RHS of (4.50) are in order: CF, SS(Rows) ， SS(Cols )， 
SS(RowsxCols), and SS(within cells). We thus have the ANOVA as given in Table 4,6. 


4.12.6 Experimental versus Observational Studies 

We shall digress here briefly from our general development of classificatory structures 
and elucidate how these structures can occur in different contexts. We shall demon¬ 
strate later (Chapter 9) that this has certain consequences regarding inference obtained 
from the models of such structures. To keep the discussion focused on the essential 
idea we confine ourselves here to the two-way classification and use the following ex¬ 
amples. 

Example 4.3: An experiment was conducted to determine the effects of three differ¬ 
ent fungicides (Ai, A 2 , As) on the yield of fruit from four different cultivars of apple 
trees (Si, B^, B 4 ). In an orchard six trees from each cultivar were available for 
the experiment. Each fungicide was applied, by random assignment, to two trees from 
each cultivar. Yields of fruit were obtained at the end of the growing season. n 


Example 4.4: An experiment was conducted to determine the effect of ozonization 
at three reaction times (Ai f A 2 , ^ 3 ) and four pH levels {B\, B 2 , Bs, B^) on effluent 
decline. Each combination Bj)(i =1, 2, 3; j =1, 2, 3, 4) was applied to two 
samples of effluent. □ 
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Table 4.6 ANOVA for Two-Way Classification with 
Equal Numbers 


Source 

d.f. 

SS 


CF 

1 

RCny 2 


Rows 

R-l 

CnE 版 . 

— y ...) 2 

Columns 

C-l 

RnT,j(y.j, 

-y ...) 2 

Rows x Columns 

(R-l)(C-l) 

71 Sy {yij. ■ 

-yi.. - y.j. +y...) 2 

Within Cells 

RC(n - 1) 

^ijk \ Vijk 

-Vij.) 2 

Total 

RCn 




EXAMPLE 4.5: A study was conducted to investigate possible differences in milk 
butter fat for cows in three age groups (Ai, A 2 , As) from four breeds (Bi, B 2 , 5 3 , 
5 4 ). For each combination d Bj)(i =1,2,3; j =1 ， 2, 3, 4) two cows were randomly 
selected and their butterfat percentages determined. □ 

The common feature of these three examples is that the observations y“k (f = 1 ， 2, 
3,; j = 1, 2, 3, 4; fc = 1 ， 2) are typically displayed in a two-way table as given below: 



Bi 

Bo 

b 3 

b 4 

Ax 

X, X 

X, X 

X. X 

X, X 

^2 

X, X 

X, X 

X, X 

X, X 

义 3 

X, X 

X, X 

X, X 

X, X 


where the x's represent the two observations for each cell, that is, each combination 
(Ai, Bj). Furthermore, each yijk can be expressed in terms of a two-way classification 
model as 

Vijk = f-l bj (db) ij 

or, mimicking (4.49), 

Vijk = cti bj -€-ij fc (4.51) 

or in terms of a cell means model as 

Vijk ~ f^ij 

or 

Vijk = ^ijk ? (4.52) 

where ai represents the effect of Ai(i = 1, 2, 3), bj represents the effect of Bj(j = 1, 2, 
3,4), and (ab)ij represents the interaction between Ai and Bj (expressing, for example, 
differences among the effects of factor A depending on the levels of factor B). 
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The reader will have noticed that although there appears to be only one linear 
model, the situations leading to this model are quite different. More specifically, Ex¬ 
ample 4.3 represents a generalized randomized block design (Section 9.7), where the 
fungicides represent the levels of the treatment factor, and the cultivars represent the 
levels of the blocking (or intrinsic) factor. Example 4.4 describes a completely ran¬ 
domized design (Chapter 6) with a factorial treatment structure (Section 11.2), where 
reaction time and pH level represent the treatment factors. Thus Examples 4.3 and 4.4 
describe intervention or experimental studies. On the other hand, Example 4.5 repre¬ 
sents an observational study, and only data from such a study should be referred to, 
strictly speaking, as a ’’two-way classification”. 

One may ask why we draw this distinction. The reason will become clear when 
we discuss the individual experimental designs and how statistical inferences can be 
drawn from such experiments (for Example 4.5 we do this in Section 4.17). We shall 
show that the distinction arises because of different properties for the error term in 
model (4.51) as a consequence of the randomization of the treatments to experimental 
units. For Example 4.3 this will lead to an asymmetry for the treatment effects, a^, 
and the block effects, 6』， a crucial distinction arising only in experimental but not 
in observational studies. This is another reason why we discourage the use of the cell 
means model like (4.52) for experimental studies since such a model implies symmetry 
of the various factors. 

4.12.7 General Classificatory Structure 

To develop a complete picture of the nesting and hence of the crossing relationship, the 
following is useful. Let the units or a set of individuals be indexed by a variable u. Let 
-A be a factor. Then individual u lies in some level of A. Denote that level by xa{u). 
Take a subset 5 consisting of k factors. Then the joint level of u with respect to S will 
be a /c-vector, S(u). Individuals u and u f will be at the same level of 5 if S(u) = S(u f ). 
Let T be another subset of m factors. Then the level of u with respect to T will be an 
m- vector, T(u). If whenever S(u) = 5(w ’)， it is the case that T{u) = T(u / ), we say 
that factor subset S is nested by factor subset T or equivalently that T nests S. It is 
useful also to define what will be meant by the product of two sets of factors S and 
T. We define ST to be the set of factors that contains the factors in S and/or in T. 
So that if 5 = ADE and T = ABF, ST — ABDEF. Let S and T be two sets of 
factors with associated partitions. Then S and T have the same associated partitions if 
S(u) = 5(u ; ) implies T(u) = T{u f ) and T(u) = T 、 v!) implies S{u) = 5(^). 

Suppose there are factors A, B and C. Then the totality of formal possible product 
factors are A, B, AB, C, AC, BC, and ABC. These will be different generalized factors 
only if there are no nestings. Suppose B is nested by A. Then B gives the same 
partition as does AB, and BC gives the same partitioning as ABC. The totality of distinct 
partitionings or factors is then 

A, AB,C, AC, and ABC. 

It is useful to name a partition by that name with the most letters. 

This leads to a useful indexing of levels of factors. If A nests B and if we index 
the levels of A by z, then we index the levels of B by ij. Then we index units in AB 
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subclasses by ij, the first index giving the level of A and the second index j indexing 
levels of B within A levels. And similarly for generalized factors. 

These ideas lead to the concept of admissible means. A mean is admissible if it 
is a mean of a subclass resulting from a particular partition. If, for instance, A nests 
B and B nests C with observations indexed by i = level of A, ij = level of B and 
ijk = level of C, then the admissible means are VUij. an d Vijk- If we have two 
factors A and B that are crossed, with factor C nested by AB combinations, and levels 
indexed by i, j, k, respectively, the admissible means are 勾…， 沒 《 .. ， y.j., yij. and y^. 

We can transfer the idea of nesting of factors to the idea of nesting of subscripts. 
So, for example, if A nests B and B nests C and we index by ijk, we say that i nests j, 
and j nests k. 

A useful idea given by Zyskind (1962) is that of the rightmost bracket of a set 
or a subset of subscripts. Suppose the full set of subscripts is i\i 2 ,i' Then 
an admissible mean is defined as one in which whenever a nested index occurs, then 
all the subscripts that nest it must also appear. Let j 1 , j 2 ,..., be a subset of the 
subscripts. Then the group of subscripts (in that subset) that nest no other subscripts is 
said to be the rightmost bracket of the subset. It is convenient to enclose the rightmost 
bracket of a subset of subscripts in parentheses. If, for example, we have a structure in 
which A nests B and C is crossed with A and B, with subscripts i, j, fc, respectively, 
the admissible means are denoted by Vi.k ，and 邮 fc). Alternatively, 

we say that a subscript belongs to the rightmost bracket of a group of subscripts if no 
subscript of the group is nested in it. 

We can now give the ANOVA of a balanced data structure, which is one such that 
when cross labeling is used for every factor, whether crossed or nested, the range of any 
subscript is the same for all possible combinations of values of all the other subscripts. 
Then with ij(uv), for example, we define the component associated with this group of 
subscripts to be 

Vij{uv) ~ Vij(u，）— Vij[.r) + Uij" 

The sum of such components with fixed ij, and summing over u and/or v is zero because 
of the balance in the balanced data structure. The idea is that a component starts with 
an admissible mean and contains admissible means given by averaging over subscripts 
in the rightmost bracket with sign equal to -1 if averaging is over an odd number of 
subscripts and +1 if averaging is over an even number of subscripts. 

For a balanced data structure, we can write an identity such as (see Section 4.12.4) 

队⑴ = 反 . + ( 仏 . 一 A.) + (Vi(j) — Vi.) 

and clearly & ( 免 .— 没 “ ）二 0, U〆 ％ ⑴—仄 •） = 0 for each i. Also we have a sort of 
orthogonality in that we see that E i ： ? _ ( 仿 •— y..){yi(j) - Vi.) is zero by summing first 
on j. 

Hence 

y h) = + _ 反 .) 2 + - Si') 2 ' 

ij ij ij ij 
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This expresses the total sum of squares of the data in the data structure as 

Total sum of squares = correction factor 

+ sum of squares between classes 
+ sum of squares within classes. 

We have a further property exemplified by the following: 

ij ij ij 

YMu) - d 2 = Y.y%) - YA 

ij ij ij 

Identities for balanced data structures contain components associated with each admis¬ 
sible mean, each component being given by averaging over subscripts in the rightmost 
bracket with the sign chosen as explained above. Each component gives rise to a sum 
of squares by squaring the component and summing over all subscripts for a single 
observation. The totality of those sums of squares then constitutes the ANOVA. As an 
example, consider the structure in which A nests B and C is crossed with A and B. 
Following Zyskind (1962) we express this data structure symbolically as (^4 : B)(C), 
where : denotes the nesting relationship and (.)(•) indicates that factors in different 
parentheses are crossed. Six admissible means, the associated components and sums 
of squares are given in Table 4,7. 


Table 4.7 Admissible Means, Components, and Sums of 
Squares for Model (A : B)(C) 


Means 

Components 

Sums of Squares 

y... 

V … 


Vi.. 

Vi.. ~ y... 

^ijk {Vi.. — 2 /… )2 

仏 u). 

" Vi-. 


y..k 

y..k ~ y... 

^ijk {y..k — 

Vi.k 

Vi.k — Vi.. — y..k + y... 

^ijkiVi.k — Vi.. — V..k + V...) 2 

Vi(jk) 

Vi(jk) - Vi(j). - Vi.k + Vi.. 

^ijk{yi(jk) - Vi(j). - Vi.k + pi..) 2 


Vi{jk) 



We note that the sum of the components is equal to the individual observation. 
Hence 


Vi{jk) — sum of components 


represents an identity which gives rise to an appropriate linear model for the given data 
structure. As a consequence of this representation and the assumed balancedness of the 




4.12. BALANCED CLASSIFICATORY STRUCTURES 


109 


data structure we also have that 

yf ⑽ =sum of sums of squares 

ijk 


as exhibited in Table 4.7. 

To help understand data structures such as the one discussed above and others it 
is useful to use a diagrammatic representation as developed by Throckmorton (1961) 
(see also Kempthorne et al. ， 1961). To illustrate this we show the following examples 
in Figure 4.1: 

(i) A (one-way classification), 

(ii) A : B (two-fold nested classification, that is, B nested in A), 

(iii) (A)(B) (two-way crossed classification), 

(iv) [(-A)(S)]: C (A and B crossed, C nested within AB), 

(v) (A : B){C) (B nested in A and C crossed with A and B). 

In these structure diagrams \x indicates the overall population to be partitioned accord¬ 
ing to the structure and 6 ： indicates the fully indexed individual observation. 

4.12.8 The Well-Formulated Model 

In order to fully understand the concepts of crossed and nested factors (see Section 4,12.6) 
and to understand the impact of these notions on the formulation of appropriate linear 
models it is useful and important to present the idea of a well-formulated model. To do 
this in complete generality would be rather cumbersome, so we shall illustrate this idea 
in terms of examples. 

Suppose we have factors A, B, C, D, E where factors A, B, C are crossed. D 
is nested in ABC combinations, and E is nested in BC combinations. The structure 
diagram is given in Figure 4.2. Then the list of generalized factors is 

A, B, AB, C, AC, BC, ABC, ABCD, BCE. ABCE, ABODE. 

We index combinations by i, j, k, l 9 m, for levels of A, B, C, D, E and n for the 
individual in ABCDE subclasses. The admissible means are then 

V{i) . ， y..(k)... 

y ⑷ . ⑻ … ， y.[jk )."， V(ijk )...： yijk(i).. , > 

y.jk.(m ).， Vijk(lm). 
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Figure 4.1 Structure diagrams. 


For each mean we also give the rightmost bracket (note that some means contain 
two brackets; this is simply done for convenience to retain the order of the subscripts; 
in this case the combined brackets constitute the rightmost bracket). 

This leads to a scalar model of the following form: 

Vijkim = +ai^rbj(ab)ij + c k + {ac) ik -h {bc) jk 

+ (abc) ijk + {abcd)ijki + {bce) jkm 4 - (abce) ijkm + {abcde) ijklm . (4.53) 

Model (4.53) may be called a full model ， with a term for every subset of every 
partition. We have to consider models that arise by elimination of factors and combi¬ 
nations of factors. It maybe that we should not include a term such as (ac)^. A simple 
idea is merely to remove the terms (ac)^. But if we remove the term (ac)ik ， we are 
removing the partition AC. We then note that doing this leaves in the model the term 
(abc)ijk which comes from the partition ABC. However, the partition ABC is nested by 
the partition AC, so if the partition AC is to be not included, then so must the partition 
ABC. We make the definition: 

A model is well-formulated if whenever a term corresponding to a partition tt is 
included, the terms associated with partitions that nest 丌 are included. 

So for instance, with crossed factors A and B, the model 


Vijk — H - Qi (a 心 ) 
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Figure 4.2 Structure diagram for model (4.53). 


is not well-formulated. If, however, factor B is nested by factor A, this model is well- 
formulated. 

We insert that discussion of the type above should make it clear why we think that 
using the means model (see Section 4.11.2) is not appropriate here and in general for 
more complex structures. The numbers of possible models can be substantial and our 
arguments spell out clearly how the various models can be derived. Moreover, in Chap¬ 
ters 9, 10, and 13, we shall show that in the context of intervention studies the factors 
for a given data structure are not always symmetric with respect to statistical inference. 
To be more specific, we show, for example, that even though the observations from a 
randomized complete block design (see Chapter 9) have the structure as given in Fig¬ 
ure 4.1 (iii) the factors A (treatments) and B (blocks) have to be treated differently due 
to the randomization procedure (see also Section 4.12.6). 

The normal equations for any well-formulated model and balanced data structure 
can be written down easily because they take the form 


Model value of RHS = Admissible sum 


for all admissible sums, corresponding to admissible means listed above, for example 
for model (4.53). To obtain an analysis of variance, we have to fit the successive models 
obtained by keeping terms providing that if a term is kept, the terms that nest that term 
must be kept, also. 

Models obtained in this way are not of full rank or, in other words, are over¬ 
parametrized. They can be made of full rank by adjoining appropriate conditions on 
parameters. The obvious choice of conditions is obtained by retaining rightmost brack- 
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ets. In the above somewhat complicated data structure (4.53), they are as follows: 

= o, bj = o, y^cfc = o. 

i j k 

> ' 人 =0, > 二 (ab)ij = 0, 〉 ： (a.c)jfc = 0 r 〉 —.: (ac) 认 = 0, 
i j i k 

= 0,^(6c)jfc = 0 ， 

j k 

〉 ： (a.6c)^ffc — 0 . 〉 ^o.bc)ijk — 0, > ' (dbc)ijk. = 0 ， 
i j k 

^ ^ ( abed)ijki = 0 , = 0 

l m 

〉 : 〔 abce)ij — 0, ^ ^ (q. 6c6 ) m ― 0. ( cl bede)ijkim = 0, 〉 ' 人 abede)ijk'im = 0. 

i m l m 

Note that, as indicated above, the conditions on any parameter involve single summa¬ 
tions over all indices in the rightmost bracket for that term (as given for the correspond¬ 
ing admissible mean). Therefore, it is easy to write out these conditions. We shall not 
pursue this example because to do so would involve a mass of equations which can be 
solved in reasonable form only if the whole data structure is factor balanced and data 
balanced. 


4.13 UNBALANCED DATA STRUCTURES 


We have seen in Section 4.12 that for balanced data structures we can easily derive 
a well-formulated model，obtain the normal equations and the ANOVA table. Much 
of the ease of doing this is a consequence of balancedness, in particular solving the 
NE and then writing out the ANOVA table. In practical situations, in particular for 
observational studies and to a lesser degree for intervention studies, we do encounter, 
however, unbalanced data structures. Such structures can lead to certain problems and, 
in fact, to different approaches for solving the NE and obtaining the ANOVA table. We 
shall not give a general discussion here, but rather consider some simple structures to 
illustrate the basic ideas. 


4.13.1 Two-Fold Nested Classification 

For the two-fold structure, a well-formulated model with natural indexing is 


Vijk — O-i -f- {ob)ij. 


( 4 . 54 ) 



4.13. UNBALANCED DATA STRUCTURES 


113 


Suppose i = 1.... .A.j = 1 ..... Bi, k = 1,..., riij. Then the normal equations are 

n•,i~i = Tij.CLi + 〉 二 几 ij(ab)ij = y••• 

i ij 

Ui.ai = 队 "，i = 1 ， ... ，乂 

3 

riijfi + riijai H- riij(ab)ij = yij ,， i = 1 ， ... ， A, j = 

where in an obvious notation n^. = n.. == = LkVijk ， Ui. ‘ — 

Ij k yij k ， y,'. = Hijk Vijk- These are easy to solve in that we may put 〆 = 0, ^ = 
0(z = 1...., A), and a solution is 

〆 = 0 ， a* = 0, (ab)^ = § 以， 

with sum of squares removed equal to Then we have to fit the model in 

which the (ab)ij terms are deleted and a solution is /i** = 0. a** = yi.Jrii .，with sum 


of squares removed equal to /rii.. Finally we have to fit the model: 

Vijk = ^ with NE solution equal to "*** = y … jn" and sum of squares removed equal 
to y 2 Jn.., the usual correction factor. The ANOVA is then 

Source 

d.f. 

SS 


(a 2 ) after " 

after " ， (a^) 

Residual 

1 

A- 1 

EBi - A 

71.. 一 

y 2 ./n.. 
^iVl./rk. - 

-y 2 ./n.' 

Total 

n.. 




4.13.2 Two-Way Cross-Classification 

We now turn to the two-way data structure. We have two factors A and B such that 
neither nests the other. A well-formulated model is 

y.ijk 二 M + 〜+ ~ + (cib) ij , (4.55) 

the ranges of the subscripts with such a structure being i = 1,... .A y j — 1 ， . … B ， k = 
0,1 ， 2, , riij. We specify that there are n 勿 . data points in cell (ij), where can be 
0. This data structure is very important in the design and analysis of experiments area. 
A very important design is the incomplete block design in which A is the block factor 
and B is the treatment factor, and only some treatment levels occur with any particular 
block level (see Chapter 9). A commonly used model, but not necessarily an appropri¬ 
ate one, is that in which there is no interaction of blocks and treatments, so the terms 
(ab)ij are zero and the ejfect of a change from level i of to level i ! of A does not 
vary with the level j of B at which this change takes place. The absence of interaction 
of this type is critically important in analysis of experiments. 
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It is important for the reader to understand the difference between models (4.54) 
and (4.55) as they are associated with different data structures. For example, both 
models contain the term but they have different meanings. In (4.54) (ab)ij 

denotes the effect of the jth level of factor B nested in the ith level of factor A, whereas 
in (4.55) (ab)ij denotes the interaction between the ith level of factor A and the jth 
level of factor B. 

The first important feature of the two-way classification is the matter of identifi- 
ability. It is useful here to invoke the cell means model, that is，to define to be 
the model value for yijk- The question then arises with a data structure that has some 
arbitrary set of cell occupancies, (n^ ), what aspects of the model values are identi¬ 
fied. As in all cases of linear models, the question is that of linear identifiability. We 
may ask then if any particular linear function of /i, {aj}. {bj}. {(ab)ij} is identified. 
The answer to this question is easily seen by representing a linear function of {〜•} 
as TlijCijfiij where necessarily summation is over cells (ij) which contain at least one 
observation. Consider then a linear function of the parameters in model (4.55): 

dofj. + > : diai + E fjbj + ^2 h .ij( ab )ij- 

i j ij 

Here the summations extend over all 2 , all j, and all ij, respectively. This linear function 
of the parameters is identifiable if there exists a set {cij.ij occupied} such that 

Cijfiij = d 0 fi + ^2 木 A + ^2 /A. + X] hij(ab)ij, (4.56) 
ij i j ij 

where E* denotes summation over occupied cells, and the equation holds identically 
when iiij is replaced by /i + + bj + {ab)ij. Comparing the left-hand side and right- 

hand side of (4.56) we must have 

E 木 \ 、氺 . 

C{j — do, 〉 v Cij = d'i (i = 1 .... ? jV) 

O' j 

Cij = fj (j •二 1 ， …， 丑)， dj = hij for all cells. 

i 

Further, from these conditions it follows that we must have 

> : 屯二 d_Q ， > : fj — do, 〉: hij = di ， > : hij = fj. 
i j j i 

Now，in order for ji to be identifiable it follows from (4.56) that we must have d 0 = 1, 
all di — 0, all fj = 0, and all hij =0. But because Edi = do, with 办 =1, these 
conditions cannot be satisfied. Hence \x is not identifiable. Also, no linear function 
of {a^} is identifiable, no linear function of {bj} is identifiable, and a linear function 
T^ijhij (ab)ij is identifiable if and only if is zero for unoccupied cells and T^hij = 
0, E *hij = 0, and = 0. In fact, the only identifiable functions are = 

J2ij c u(M + ai + bj + (ab)ij), for any {c^}. 
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For model (4.55) the admissible means are y..., y.j.*, Vij.. According to the rule 

given earlier，the NE for this model can be written out easily as follows: 

+ n^ai + ^2 n 'j b j = V.. 

i 3 ij 

几 i.M + 打 i.a.i + ^ Tiijbj + 〉 ' 几 ij (ab)ij = Hi, 

3 j 

^.j M H - 〉二 叫 j 叫 + n.jbj 心 ) f = y.j 

i i 

~\~ Tlijbj H - = Vij. = 1， • . • ， A; = . . • ，石 ) 

Just as for model (4.54) it is easy to solve the NE by putting p = 0, 兩 = 0 (i = 
1,..., A), 6j = 0 (j = 1,.... B). A solution then is 

〆 = 0 ， a* = 0, bj = 0, (ab)h = 勾 ij. 

To obtain the ANOVA table we need to fit successively smaller, that is, reduced models. 
In the context of Section 4.10 model (4.55) is a four-part model. Ordinarily a four-part 
model would have associated with it 4! = 24 ordered four-part models and hence 24 
associated ANOVAs. This, however, is not the case for the type of classificatory model 
we are considering here. This is a consequence of the fact that not all such models are 
well-formulated models. For example, the model 

Vijk = f-i cii (a,b)ij 

is not a well-formulated model for this case or, for that matter, the ordered four-part 
model 

Vijk = fJ^ Cli [cib)ij bj 

is not a well-formulated model. In fact，the only two well-formulated models are 

Vijk = // + + bj + (ab)ij 

and 

Vijk = p bj H - ~h (ab)ij. 

The issues of a well-formulated and overparametrized model point to some diffi¬ 
culties with the derivation of the ANOVA table, or better tables, for the model (4.55) 
with Uij ^ 0 and ny 二 0 for some (i, j). This would require a huge amount of writing. 
We shall point out, however, that it is in this context that the means model mentioned 
in Section 4.11.13 is useful. Indeed, it is for situations like this that the means model 
has received more attention. It allows us to spell out sets of identifiable or estimable 
functions of the fiij which may be of interest in explaining data. Since fiij = 如 , and 
var(/^j) = a 2 /riij for riij > 0, it is easy to test hypotheses about We 

emphasize, however, that it is the fact we are dealing here or in other more complex 
situations of this sort with the full model, that is, the model that includes all interactions 
and hence does not require any side conditions, that leads to this ease. Furthermore, we 
need to distinguish carefully between comparisons (of cell means) for experimental or 
observation studies (see Section 4.12.6). 


(z’= 1 ， … ,t4) 
(j = 1,... ,5) 
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4.13.3 Two-Way Classification without Interaction 

A special case of model (4.55) occurs when there is no interaction. For such a no¬ 
interaction model we can write 


Vijk = f^ij — fJ- ai bj . (4.57) 

Just as with model (4.55) a linear function of the parameters in (4.57) is identifiable if 
there exists a set {cij.ij occupied} such that 

^ + ^2 diCLi + H 恤 ‘ (4.58) 

i 3 

From (4.58) we infer immediately that 

= do, ^2 本 Cij = di ， *C{j — fj 

3 i 



It is then obvious that, just as for model (4.55), // in model (4.57) is not identifiable. It 
is, however, possible now to choose the such that E^c 勿 • = 0, not all di = 0 but, 
of course, = 0, and all fj = 0. Thus linear combinations of the are 

identifiable for different choices of the di ，These choices depend on the structures of 
the occupied cells. We shall illustrate this with some examples. 


Example 4.6: We consider the following 4x4 two-way structure where occupied 
cells, that is, identifiable fiij, are marked by x: 

B 



Then, with C 21 = 1, C 31 = -1 ， all other = 0, a 2 - a 3 is identified. It is then easy to 
see, by simply looking at each column in this fashion, that a\ — as, a\ - as - 
ai-a 2 ,anda 2 ~a 4 are also identified. Obviously, any at — (2 ^ i f : i, i' = 1. 2, 3,4) 

is identifiable. We say that this data set is row-connected. Looking at the rows in the 
same way we see that 61 — & 2 J 1 —石 3 ， and b\ — b 4 , are identified, and hence all 
bj — bj> (j ^ 二 1 ， 2, 3, 4) are identifiable, a property we refer to as column- 

connected. This exemplifies a rather obvious result: if a two-way data set is row- 
connected, it is column-connected and vice versa. □ 


Example 4.7: The simplest example of a connected array is in the diagram: 
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□ 


The point is, simply, that in general, with y = X/3, if sl[/3 and a! 2 f3 are identified, 
then so are cia^/3 + C2^20 for any scalars c\ and C 2 . This is provable in that if = 
a ; 2 = z/^X, then cia.[ 4 - C 2 a^ = {c\v\ + C 2 P;)X. Obviously, the result holds 
only with respect to rows and columns that are represented in the data set. 

Example 4.8: It is useful to give a small, nontrivial example of a two-way data set 
that is not row-connected and equivalently not column-connected: 

B 


A 


We see that columns 1, 2, and 3 are row-connected as are columns 4 and 5, while rows 
1, 2, and 5 are column-connected as are rows 3 and 4. This data set consists of two 
disconnected subsets: rows (1, 2, 5) by columns (1, 2, 3), and rows (3, 4) by columns 
(4,5). □ 

The NE for model (4.57) are easy to write out following the recipe given earlier: 

+ = y.. 

i 3 

n-idi + ^ n-ijbj = (i = 1,..., A) 

j 

+ 〉: "i - n.jbj = y,j {j = 1， . • • ， 



We recognize that for this set of 1 + A + equations, summing equations 2 to (^4 + 1) 
yields equation 1 and so does summing equations (A + 2) to (1 + /I + B). Hence 
the rank of the coefficient matrix is equal to A + 丑 一 1. One way to solve the NE 
then is to use the method described in Section 4.4.4 [see 2(iii)]: Using the fact that 
individual a^s and bj’s are not identifiable, we set a a = 0 and bs = 0 and solve 
the remaining A-\- B — 1 equations in " ， ai(i = 1,.A — 1), bj(j = 1，…， — 1). 
Following arguments described earlier it is then clear how to proceed further and obtain 
the ANOVA table or tables for model (4.57) (see also Section 4.7). 
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4.14 ANALYSIS OF COVARIANCE MODEL 

In previous sections we have discussed the linear model y = X/3 in general terms. We 
then focused attention on classificatory models since they play a major role in modeling 
observations arising from comparative experiments. We now turn to models which 
incorporate elements from both classificatory and regression models. These models, 
too, play an important role in the design and analysis of comparative experiments (see 
Chapter 8). The analysis using such models is generally referred to as analysis of 
covariance. 

4,14.1 The Question of Explaining Data 

We place this in the context of approximative or explanatory or descriptive linear mod¬ 
els. We suppose that we have observations which arise from classificatory data struc¬ 
ture with what are called concomitant variables. We are interested in the value of 
classificatory variables and/or concomitant variables, towards explaining or describing 
the variation in the observation variable. 

We can define the questions that are of interest by first writing the linear model 

y = X/3 + Z 7 , (4.59) 

where X represents the values of classificatory variables and Z represents the concomi¬ 
tant variables. We then partition X into Xi and X 2 , and Z into Zi and Z 2 so that the 
linear model is 

y == Xi^ + X 2 / 3 2 + Zi 7 x + Z 272 . 

Because we are interested in describing or explaining the variation in y, we would 
normally include a term 3/?o so that a model with useful degree of generality is 

y == + XiA + X202 + + Z27 2 * 

Here is one example (see Chapter 9): 

Xi is a block incidence matrix, 

X 2 is a treatment incidence matrix, 

Zi represents the observations on one concomitant scalar variable, 

Z 2 represents the observations on another concomitant variable. 

We are interested, then, in the questions: 

(i) Do Zi, Z 2 help explaining the data? 

(ii) Does Zi help? 

(iii) Does Z 2 help? 

(iv) Do Xi ， X 2 help? 
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(v) Does Xi help? 

(vi) Does X 2 help? 

(vii) Do X 2 , Z 2 help? 

plus other questions given by altering subscripts of Xs and Zs. 

To answer these questions, we have to ask what they mean. What should we mean 
by “HELP ”？ It is obvious that if we wish to describe or explain the variation in y, we 
can insert whatever explanatory variables that come to mind. If, for instance, we are 
trying to describe the variation in a test score over, say, 30 individuals, we might insert 
the variable SN, the social security number of each individual, and then consider a 
polynomial in SN ， (SN) 2 . … ， (5A r ) 29 . We would describe the variation in test score 
perfectly because the residual sum of squares would be zero. 

We use least squares fitting to address these questions. As we have seen with y = 
'X.i/3 1 + X 2 /3 2 (Section 4.7), we form a quantification of what X 2 does by considering 
SS(X 2 |Xi) and SS(I|XiX 2 ). The former measures how much X 2 helps after X! 
is used, and the latter measures the residual variation after using Xi and X 2 . So we 
address the questions by computing the following: 

⑴ SS(Z 1 Z 2 |3 XiX 2 ) 

(ii) SS(Zi|3X 1 X 2 Z 2 ) 

(m) ss(z 2 |a XiXsZx) 

(iv) SS(XiX 2 |3 Z 1 Z 2 ) 

(v) SS(X!|3 X 2 Z 1 Z 2 ) 

(vi) SS(X 2 13X 1 Z 1 Z 2 ) 

(vii) SS(X 2 Z 2 |a XxZj 

We may note in passing that there are 14 possible interesting sums of squares with the 
formulation we are presenting, given by 5(a|rest) where a is one of 

Xi, X2, Zi, Z2, X1X2, X1Z1 ， X1Z2,X2Z1 ， X2Z2, Z1Z2, 

X 1 X 2 Z 1 , X 1 X 2 Z 2 , X 1 Z 1 Z 2 , X 2 Z 1 Z 2 

We compare each of these sum of squares with SS(I|3 X 1 X 2 Z 1 Z 2 ). Just how we can 
compare these will be discussed in Section 4.17, in which we give an exposition of 
tests of significance. There is, in fact, simple arithmetic if the factors represented by 
Xi and X 2 are orthogonal partitions. In that case we have a simple ANOVA that we 
shall describe now. 
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4 . 14.2 Obtaining the ANOVA Table 

The general situation is given by the model (4.59): 

y = X/3 + Z 7 - 

We suppose that fitting X/3 and the ANOVA associated with X/3 can be specified 
easily. We then know that the NE for (4.59) are 

X，Xb + XZg = XV 
ZXb + ZZg = Z'y, 

with solution given by 

Xb-Px(y-Zg) 

zqi-PxlZg^Z^I-PxJy. (4.60) 

For purposes of identifiability we must have that Z’[I 一 Px]Z is of full rank. The sum 
of squares of y resulting from fitting y = X/3 + Z 7 is equal to 

/Pxy + g^Il-PxJy- (4.61) 

Suppose now that in (4.59) we have X = (X!:X 2 ) and, conformably, (3 f = (/3\ ， / 3^). 
And we wish to compare the models 

y = + Z 7 

and 

y 士 XM 1 +Z 7 . (4.62) 

To return to our introductory comments, we wish to assess how much X 2 helps explain¬ 
ing the variability of y. In typical applications X 2 represents a treatment incidence ma¬ 
trix and 0 2 represents the vector of treatment effects, r (see Chapters 8 and 9). What 
is needed then is SS(X 2 |Xi. Z) which can be obtained by using the method explained 
in Section 4.7. To this end we need to determine the sum of squares from fitting model 


(4.62)，and to determine the sum of squares we have the ANOVA of y, a sum of squares 
y’Pxiy，and the RNE for 7 : 

Z / [I-P Xl ]Zg = Z / [I-P Xl ]y. 

(4.63) 

The result then is 


y ， P Xl y + fZ ， [I-P Xl ]y. 

(4.64) 


To obtain SS(X 2 |Xi. Z) we simply take the difference between (4.61) and (4.64). 
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4.14.3 The Case of One Covariate 


In order to illustrate the simple general structure of the arithmetic or algebra of the 
analysis of covariance we consider first the usual structure where Z is an n x 1 vector 
z and 7 is a scalar, 7 . Using the solutions to the RNE for 7 , that is, (4.60) and (4.63), 
given by 

_ z’[I - P x ]y 
9= z1I-Px]z 


and 


respectively, we obtain 


- = z'[I - P Xl ]y 
5 一 z ， [l_P Xl ] z ’ 


SS(X 2 |X 1 ,z)=y , (P x -Px 1 )y + 


[ z '(l - P x )y ] 2 

z’[I _ Px]z 


[z’(I_P Xl )y ] 2 

z[I-P Xl ]z 


(4.65) 


By inspection we see that (4.65) contains different types of sums of squares and sums 
of products in y and z. These can be represented as in Table 4.8(a). For brevity, the 
SS and SP of Table 4.8(a) are renamed in Table 4.8(b). The SS and SP for the model 
y = + Z 7 are obtained by simply amalgamating the X 2 IX 1 and I|XiX 2 lines 

in Table 4.8(b) as given in Table 4.8(c). Then (4.65) can be written as 

SS(X 2 |Xl,Z) = Tyy + Eyy - ^ - ^ ^ . ^ 4 ' 66 ) 


To form an opinion on whether X 2 /S 2 useful after including 'Ki/3 1 + Z 7 we have to 


compare (4.66) with 



s ， 


which is the remainder sum of squares for the model y 土 X/3 + Z 7 . We shall discuss 
how this comparison may be made in Section 4.17 and give more details on this whole 
procedure in connection with specific experimental designs (see Chapters 8 and 9). 


4*14.4 The Case of Several Covariates 

To conclude this section we comment briefly on the case of more than one covariate, say 
m covariates. For this purpose we write Z = (zi, Z 2 ,.... z m ), = ( 71 ， 72 ， . ■. ， 7m )， 
and g’ = (沒 1 ，分 2 ,…, g m ) in (4.59). Then the RNE for 7 as given in (4.60) consists of 
m scalar equations 


/ zi \ 
Z2 

[I — Px](Zl ， Z2, … ， Z.m) 

/ gi\ 

92 

= 

〆 z i \ 
z 2 

[I- Px]y 

VmJ 


\9m / 


\ Z ， m) 
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Table 4.8 Auxiliary ANOVAs 


(a) Sums of Squares and Sums of Products for y = Xi/3 x + X2/9 2 + Z 7 


Source 

SS(y) 

SP(y,z) 

SS(z) 

X x 

y’Pxj 

y'PxjZ 

z’P Xl z 

X 2 |X x 

y’[Px -Pxjy 

y’[P x - P Xl ]z 

Z'[Px - Pxjz 

I|X!X 2 

y’[I —P x ]y 

y'[I_P x ]z 

Z'[I - Px]z 


(b) Symbolic Expressions for SS(y), SP(y, z), and SS(z) 

Xi 

■^yy 


a 22 

X 2 |X! 

Tyy 

Tyz 

Tzz 

I|X!X 2 

Eyy 

E yz 

E zz 

(c) Symbolic Expressions for SS(y), SP(y, z), and SS(z) for y = Xi/3 x 

X! 

Ayy 

^■yz 

■^■ZZ 

I|Xx 

Tyy + Eyy 

7^2 + Eyz 

T zz + E zz 


The ith equation (i = 1, 2, ... ， m) is given by 

XX[I - P x ]Zj+& = Z;[I - Px]y 

3 


We note that all the arithmetic that is involved here can be represented by the ANOVA 
ofy, zi, Z 2 ,..., z m and partition of the sum of products of y and each z^. Furthermore, 
the partition of the sum of products of y and a particular z, say Zj, is given by the 
ANOVA of y + z，with for any source in the ANOVA 


SS(y + Zj ) = SS(y) + SS( Zj ) + 2SP(y, Zj )， 

where SP(y, Zj) is defined by, or given by, this equation. For m = 2, this is illustrated 
in Section 8.7. 

4.15 FROM DATA ANALYSIS TO 
STATISTICAL INFERENCE 

So far we have described some procedures for analyzing data, that is, looking at data. 
In that presentation, we start with the data and we adjoin no assumptions. A standard 
problem is that we wish to make inferences. In the area of the present book, our aim is 
to establish laws of uncertainty with regard to treatment effects. We shall discuss this 
in the chapter on randomization (see Chapter 5). This outlook and procedure stands 
outside the standard ideas of mathematical statistics. It is based, however, on ideas 
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coming out of general mathematical statistics. It is necessary therefore, to describe the 
general approach of that area. 

The approach uses the assumption exemplified in the standardly exposited case, 
that our data xi, is a realization of n random variables Xi ， X 2 ,..., X n 

that are distributed independently according to N(f^, a 2 ) (or whatever). One must ask 
where this assumption comes from. It permeates statistical theory and practice. It is the 
basis of all theory whether frequentist or Bayesian. The frequentist approach in general 
says the data set D, say, is a realization of some random entity that has a distribution 
Fd which depends on a parameter, scalar or vector, 0. The Bayesian approach adjoins 
the assumption that ㊀ is a realization of some random entity that has a distribution, say 
G©, which is fixed or depends on a parameter scalar or vector, ip. And so on, leading 
to what is called hierarchical Bayes process. 

What are the problems? The obvious one is the assumption that observations are 
a realization of independent random variables. This is the initial assumption and it is 
surely of very doubtful status. To exemplify this we consider as a simple example the 
agronomic field experiment. 

Suppose one wishes to compare the yields of two varieties of, say, corn. One finds ， 
by some process that can be described, a piece of agricultural land. One partitions this 
land into plots. We need not mention blocking because that idea is irrelevant to the 
point under discussion. We put variety one on some plots and variety two on other 
plots. We get observations {0\j} and {C^}. It is easy, absurdly easy, to say that an 
appropriate model is 

Oij = Cij 

with {eij} being independent realizations of the mathematical random variable X 
which is distributed as iV(0 ， cr 2 ). With this assumption one can apply the statistical 
tests, etc., the so-called inference procedures of a first course in statistics. But what is 
the justification for this assumption? The field plots are not a random sample of any 
population. 

It may be intuitively reasonable to take the view that the field plots chosen are the 
outcome of some stochastic process. In recent years, ideas of spatial statistics and 
spatial random-processes are being put forward for this experimental situation. The 
problem that immediately arises is that there is a huge number (indeed, an uncountable 
infinity) of such processes, each being described by a sentence. To use these ideas one 
has to make a choice of what assumptions to make. 

The approach followed by essentially every writer is to make use of the simple 
normal stochastic linear model. 


4.16 SIMPLE NORMAL STOCHASTIC 
LINEAR MODEL 

4.16.1 The Notion of Estimability 

We now turn to the basic case of a stochastic linear model: y = X/3 + e, in which X/3 
is a fixed unknown vector in C(X), e and hence y are random vectors in R n . Rather 
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naturally we assume that the expectation of e, E(e), is null, because with E(e) = 綷 ， a 
nonnull vector, we would have 

y = /x + X/3 + [e — 五 (e)] 

or 

y = /i + X/3 + e. 

with E(e) = 0 and then E(y) = [x 4 - X/3 is an unknown vector in the linear variate 
{fi + X/3}. So we consider the stochastic model: y = X/3 + e, with E(e) = 0. We 
then have E(y) = X/3. We suppose that /3 is some unknown fixed vector in R p which 
is not restricted a priori in any way. As discussed briefly in Section 14.4, we say that 
a parametric function, 入 , /3, is estimable if there exists a vector a, such that E(a’y)= 
X’ (3. Obviously, this tells us that 入 / /3 is estimable if and only if there exists an a such 
that X f = a’X，or 入 e i?(X), recalling that i?(X) = {K’ = i/X for some u}. This 
tells us also that X /3 is estimable if and only if X’(X’) _ A 二入 . 

The general idea of estimability is useful in many contexts. Consider, for instance, 
the 2 -part model: y = Xi/3! + X 202 + e , We may ask if a parametric function, 入 ’ 
is estimable. For this to happen we must have that there exists an ai such that 

a^Xi = a^X2 = 0. 

We can write this in various ways, of which the following is informative. Using M-P 
inverses we need 

Xia, = Ax ， 

which implies 

ai = (X^)^! + [I - (X+)X；] 7 , 
for some 7 . So we must have 

X , 2 (X+) / A 1 + X^[I- (X JXih = 0 
or 

x' 2 = X^(X+) , X , l! X^X+yAi = 0, 

a potentially useful statement of what must hold for 入 ’ 丄 /^ to be estimable. 

Rather clearly, we have gone as far as we can go without additional assumptions. 
The next natural step is to assume that the random vector e possesses a variance matrix, 
that is, E(ee f ) 9 an n x n matrix, the variance matrix of y, exists. 

4.16.2 Gauss-Markov Linear Model 

The natural initial assumption to consider is E(ee f ) = a 2 I. We give the following 
definition: 

Definition 4.1. The Gauss-Markov Linear Model (GMLM) is 

y = X/3 + e. E(e) = 0. 五 (ee') = cr 2 I ， 


where X is the known and fixed model matrix and f3 is an unknown p x 1 vector 
parameter that may take any value in R p . □ 



4.16. SIMPLE NORMAL STOCHASTIC LINEAR MODEL 125 

The following basic theorem is one of the most important results in linear model 
theory. 

Theorem 4.1 (The Gauss-Markov Theorem). With the GMLM, the best (minimum vari¬ 
ance) linear unbiased estimator (BLUE) of an estimable 入 ’/3 is given by A b where 

b is any solution of the NE: X’Xb = X’y, 

Proof. We know X f = a’X, 入 ’/3 = a’X^Xb = Pxy, 入 ’b = a’Pxy. Consider 
the linear estimator (a’Px + 占 ’)y for arbitrary S. For unbiasedness we must have 
(a’Px + (5’)Xy3 = 入 ’/3 = a’X/3. With /3 free this tells us that <5’X — 0 or 6’Xb = 0 
or yPx ： = 0. Hence 

var[(a’P x + <5’)y] = cr 2 (a’P x + 5’)(P x a + 6) 

=a 2 (a / P x a + 2&P x a + S f S) 

= cr 2 (a’Pxa + 

This is minimized with respect to S, obviously, by taking (5 = 0. So the result is 
proved. 

The result of Theorem 4.1 implies, of course, that 

E = E[X ， b) = V/3. 

For later purposes it is useful to obtain also an expression for var ( 入 • Using results 
from Sections 4.4.2 and 4.4.4 we find 

var ( 入 ’/?) = var(A / 6) 

=var(a’Xb) 

=var[a / X(X / X)-X / y] 

=a’X (X'X) 一 X'X(X'X) 一 1 X W 2 
— a’PxP^cacr 2 
=a’Pxaa 2 
=a / X(X / X)-X / aa 2 
=A’(X’X)- 入 a 2 . 

In words: the ^-inverse (X 7 X)~ acts as the variance-covariance matrix (apart from a 2 ) 
for estimable functions, independent of which 分 -inverse is used. 

We next give a very important generalization of the Gauss-Markov Theorem. 

Theorem 4.2 (The Aitken Theorem). Suppose y = X/3 + e • 五 (e) = 0,_E(ee')= 
V(j 2 with V invertible, then the BLUE X(3 of an estimable (3 is A b where b satis¬ 
fies the Aitken equation 

X'V-iXb = X^V-iy. (4.67) 



126 


CHAPTER 4. LINEAR MODEL THEORY 


Proof. Since Visa real symmetric matrix, there exists an orthogonal matrix O such that 


OVO = D = 


fd，i 

C?2 

0\ 

\o 

/ 


= diag(di,d 2 ,...,rfn). 


Then because V is an invertible variance matrix, the elements di(i = 1,..., n) are 
positive and hence have square roots, so we can form D 1 / 2 where we take the positive 
root always. Then 


V = ODO’ = 0D 1/2 0’0D 1/2 0’ = V 1/2 V 1/2 where (V 1/2 ) / = V 1/2 . 


Consider then 

V - 1/2 y = V - 1/2 X；3 + V - " 2 e, 

which is a linear model, and can be written (using obvious notation) as 


y* = X*/3 + e* 


with 

五 (e*) = 0, E{e*e* f ) = V- 1/2 VV _1/2 a 2 = la 2 . 

Hence the derived model is a GMLM. In this case 入 ’/3 is estimable if and only if there 
exists an a* such that a*’X* = Y or (a* / V~ 1 / 2 )X = A ; which is equivalent to 
a’X 二入 / with a! = a*’V _1 / 2 . We then apply the Gauss-Markov Theorem and we 
know that the BLUE of an estimable function 入 ’/? is A'b，where 

X* ， X*b = X*V 


or 

X，V 一 1 Xb = X'V—V 


4.16.3 Ordinary Least Squares and Best Linear 
Unbiased Estimators 

An interesting question is: When is the so-called ordinary least squares (OLS) esti¬ 
mator for an estimable 入 ’/3, obtained from (4.4), also BLUE, as obtained from (4.67 )， 
which is referred to as generalized least squares (GLS) estimator. 

We can address this very easily. If Xb = Pxy satisfies the Aitken equation (4.67) 
for all y, then the following equations result: 

XV^Px = X'V -1 
PxV 1 P x = PxV- 1 
PxV 1 = V^Px 
VP X = PxV. 



4.16. SIMPLE NORMAL STOCHASTIC LINEAR MODEL 


127 


Hence 

VX = XQ 

for some Q for the least squares estimator to be BLUE. Contrariwise, if VX = XQ 
for some Q, then 

VX = XQ 

implies 

VXB = XQB 
VPx = XQB 
PxVPx = VPx 
VP X = P X V 


or 


(where X，XB = X，） and 


so 


and by transposition 


V-iPx = PxV 1 

XV-ipxy = XP x V-V 
X / V~ 1 X(By) = X'V-iy. 


So b = By satisfies the Aitken equation and Xb = XBy = Pxy satisfies the Aitken 
equation. Estimable 入 ’/3 is a ; X/3 with OLS estimator a’Pxy which is the same as 
what the Aitken equation gives. 

The result given above is particularly important in the context of linear models 
for data from intervention studies. We shall show, starting with Chapter 6, that the 
derived linear models associated with the various error-control designs have a variance- 
covariance structure of the form Vcj 2 with V / I, but with V such that OLS estimators 
are, indeed, BLUE. 

In discussing the Aitken Theorem we have assumed that V is nonsingular. Without 
going into any detail we shall mention briefly the case where V is singular. Obviously, 
we cannot form the Aitken equation. It can be shown, however, that the BLUE of X/3 
is given by A f y, where 


VA + XM = 0 
XA = X 


for some matrix M. There is a considerable literature on this [Zyskind (1967)，Rao 
(1967, 1971 ， 1973), Kempthorne (1971 ， 1972, 1973a,b, 1976), Watson (1967)，and 
Kempthorne and Doerfler (1969)]. 

We shall merely state, without proof, 

(i) The OLS estimator and the BLUE are identical if and only if VX = XQ, and 

(ii) the class of matrices V such that the BLUE of an estimable 入 ’/3 as given by OLS 
is 

V = c 0 I + PxAPx + (I - Px)B(I- P x ), 
where A and B are arbitrary matrices such that V is a variance matrix. 
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4.16.4 Expectation of Quadratic Forms 

We can adjoin a few very useful ideas from our present basis. With the model: y = 
X/3 + e, with 五 (e) = 0 and E(ee f ) = V, we have for a fixed vector a, E(a’y)= 
a ; X/3. Also, we can obtain higher moments of the elements of y. The simple one is 
五 (y’Ay), the expectation of a fixed quadratic form. We have 

y ， A y = (X0 + e )'A(X/3 + e) 

and with E(e) = 0, 

E(y f Ay) = E(X/3) / A(X/3) + 五 (e’Ae) 

=AX0 + E[trace(e ， Ae)] 

=/3’X’AX/3 + E[trace(Aee / )] 

=AX/3 -f trace[AE(eeO] 

= (3'X f AXf3 + trace[(AV)] (4.68) 

If now V = cr 2 I, (4.68) becomes 

P ； X f AXf3 + a 2 trace A (4.69) 

and if A is symmetric idempotent, (4.69) equals 

AX/3 + a 2 rank A. 

This provides useful simple results on the vector of residuals, defined to be y - 
Fit(X/3). With least squares fitting this is y - Pxy. This residual vector has variance- 
covariance matrix equal to 

E(I- P x )ee / (I - P x ) = (I _ Px)V(I - P x ). (4.70) 

If V 二 a 2 I, (4.70) is a 2 (I — Px). The residual sum of squares is 

[(I - P x )y]，(I - Px)y=[(I- Px)e]，(I - Px)e 
with expectation equal to 

E trace [(I - Px) 2 (ee / )] = a 2 trace (I - Px) = cr 2 (n — rank X). 

4.17 DISTRIBUTION THEORY WITH 
GMNLM 

4.17.1 Distributional Properties of X/3 

By Gauss-Markov Normal Linear Model (GMNLM), we mean the stochastic model: 
y = X/3 + e, E(e) = 0. E(ee') = <r 2 I 
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and e 〜 A r n (0, a 2 I), that is e and hence y follows the multivariate (n-variable) normal 
distribution (MVN). We shall throughout use the notation y 〜 N n (fi. E) to mean that 
y is an n x 1 random vector that has a multivariate normal distribution with mean vector 
equal to \x and with variance matrix equal to S. 

The distribution theory associated with GMNLM is rather straightforward: 

⑴ The estimator 入 ’/3 of an estimable 乂 (3 = a’X/3 is a’Pxy = A / (X / X) _ X / y = 
//X’y, where p satisfies X ; Xp = A, and follows N[X!(3, a 2 p'X). Also p'A = 
A'X’X) 一 A. The equations X’Xp = A are called the conjugate NE. 

(ii) Suppose we have k linearly independent estimable functions 入 ; /3, i = 1, 2,.... /c. Th 

cov(Ai/3. Aj-/3) = cr 2 p 1 入 =<J 2 Kpj, 

where = Ai ， X’Xp)= 入 j. If we write 

W- {p'^) 

then with 0 f = ( 入 ’ ， •. ， X’ k j3) and corresponding 0, we have that 

e ^ N k {e,a 2 w). 

Additionally, 

- {e-eyw-^e-d) 2 

where is a random variable having the chi-squared distribution with k degrees 
of freedom. 

(iii) The estimator of any estimable function is p’X’y for some p. The residual vector 
is (I — Px)y. Then any set of linear functions {p-X^} and any set of linear 
functions {^(I — Px)y} being linear functions of multivariate normal random 
variables, have a joint multivariate normal distribution which is specified entirely 
by a mean vector and a variance matrix. But 

co+'XV, i/(I - P x )y) = Eip'X'eu'il - P x )e] 

= £'[p / X ， ee / (I-P x )i/] 

= a 2 p'X'(I - P x )" = 0 

because P x X = X ， X, = X’Px. So any linear function p’X’y and any linear 
function i/(I - Px)y are uncorrelated and hence independent. 

It is then natural to use the following terminology: 

The estimation space is the set of linear functions {p’X’y : p 6 R p }, and 

the error space is the set of linear functions {i/(I — Px)y ： v ^ R n }. 

Then any function defined on the estimation space is independent of any function de¬ 
fined on the error space. So the estimators of any set of estimable functions is indepen¬ 
dent of (I - Px)y = (I — Px)e and hence independent of y’(I — Px)y which is the 
residual sum of squares. 
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4.17.2 Distribution of Sums of Squares 

We still have to think about the distribution of the residual sum of squares and of 
any sum of squares in an ANOVA and the joint distribution of linear and quadratic 
functions. We can accomplish this task rather easily. Every sum of squares in our 
ANOVA is of the form y’S^y，where S- = is s.i.p. with roots that are 1 

or 0. More specifically, with s 二 fc + 1 sources, we have，using the notation for the 
projection matrices in Section 4.11.1 ， 

Si = Pi, S{ = P 12 … jzt, (i = 2, …， /c),S s = I - P 12 ...fc, 

with 

I = Si + S 2 + ... + S_5 

and 

SiSj = 0 (i ^ j). 

Hence by the standard theorem that with {S^S^Sj = SjS^} there exists a single or¬ 
thogonal matrix that diagonalizes every with s sources in the ANOVA there exists 
a single n x n orthogonal matrix 

O = (0i:02: … :o s ), 
where is n x = rank(Sj), such that 

O-S^Oi = I n . O-SjOj = 0, i ^ j. 

(see e.g. Harville, 1997, Section 21.13). The ANOVA comes from 

y = Siy + S 2 y-h-*- + S 5 y 
and then, using the fact that OCV = I n ， 

O’y = (0 , S 1 0)0 , y + (0S20)0y + + (OS s O)Oy. 

Because of 4.71 we have 

8 

D。％。) diag^Ch. 0 2 0 2! .... 0:0 S ]， 

i=l 

where diag[ • ] is a block diagonal matrix. Together with (4.73) this implies that O-O^ = 
I ri . Hence with z = O’y’Zi = O-y, we have, using (4.72) and (4.73) ， y’S^y = 
(0-y) / (0^y) = z-z^. Furthermore, zi, Z 2 , ...， z s are vectors that are distributed ac¬ 
cording to the multivariate normal distribution. Each will have a certain mean vec¬ 
tor; the variance of each vector is a 2 l of appropriate dimensions and the covariance 
matrix of different vectors is null. Hence the {z^} are independent. Clearly, then, 
{y’Siy = z^Zi} are independent. Finally, we know that if a vector z is N(u, V), that 
is, is multivariate normal with mean vector v and variance matrix V, then 


(4.71) 

(4.72) 

(4.73) 
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(i) (z — i/) / V~ 1 (z — v) is distributed as x 2 with degrees of freedom equal to the 
dimensionality of z, 

(ii) z’V 一 1 z is distributed as the noncentral chi-squared distribution with the same 
number of degrees of freedom and noncentrality parameter z/V _1 z/ [or \ of this 
with an alternative definition]. 

From this, we know the distribution under GMNLM of every sum of squares. In our 
case V = a 2 l of appropriate dimension. Hence, under GMNLM with sums of squares, 
S5i, 5^2,... ? SS s it is the case that 5Si/cr 2 is distributed with its associated degrees 
of freedom ， ri，according to the noncentral distribution. Also the separate such 
variables are independent. A particular sum of squares has notable behavior, namely, 
the residual sum of squares with y = X/3+e: This is equal to [(I — PxjyfRl —Px)y] 
but (I — Px)y = (I —Px)(X/3 + e) = (I —Px)e which has zero expectation. Hence 
we have, under GMNLM, 

SS (residuals) 2 

^2 〜 Xn-r, 

where r = rank(X). 


4.17.3 Testing of Hypotheses 

From the discussion above, we have the following as simple consequences: 


(i) The estimator of any set of estimable functions is distributed independently of 
the residual sum of squares. 

(ii) The estimator, 入 ’ 卢 ， of an estimable function 入 ’/3 is such that 


办 - Y/3 


where, as usual, p satisfies X’Xp = 入， s 2 is [sum of squares of residuals/(n-r)] 
and t is a random variable that follows the ^distribution with (n - r) degrees of 
freedom. 


vV 入 5 2 


(iii) 




t’ n _ r where t l n _ r is a random variable that follows the noncentral 


^-distribution with noncentrality equal to 


A ； /3 

\J p'\o 2 


(iv) With 0 r = (A^/3, A 2 /3, …， A^/3), a vector of m linearly independent estimable 

A A, -- — ' 、 - 

functions, with associated estimator 6 where 6 = (入 ’i/3, 入 ’ 2 ^, .. ■， 入 ’ 爪 /3 )， and 


A^ = P'XV, 
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where = 入 ^ and with 


it is the case that 


and 


V =( 必） 

0-0~iV(O,V<7 2 ) 


{e-eyv-^e-d) 


ms^ 


Fm 


,n —r ； 


where F m ' n _ r follows the F distribution with numerator degrees of freedom 
equal to m and denominator degrees of freedom equal to (n — r). 


(v) Furthermore, 

ms 2 

is distributed according to the noncentral F distribution with the same degrees 
of freedom and with numerator noncentrality equal to (0’V _1 0)/a 2 , or half of 
this with an alternative definition. 


(vi) For any sum of squares SSi(z = 1 ， 2,… ， A:) in the ANOVA table and SS(residuals) 
we have 

ss"n , 

SS(residuals )/(n — r) n,n-r’ 

where F^. n _ r denotes the non-central F-distribution with and n — r d.f. 

The random variables defined in (ii) and (iv) can be used to test hypotheses about a 
single estimable function and about a set of m estimable functions, respectively. And, 
finally, the statistic defined in (vi) can be used to test Hq ： UiUi/cr 2 = 0, that is, the 
noncentrality parameter for SSi equals zero (i = 1. 2...., fc). If Hq is true, then 
F^.^ n _ r = F ri ， n _ r ，the central F-distribution. 


4.18 MIXED MODELS 

4.18.1 The Notion of Fixed，Mixed and Random Models 

The general linear models that we have discussed so far are referred to, following 
terminology introduced by Eisenhart (1947 )， as fixed effects models or, for short, 
models. This means that, rewriting model (4.1) as model (4.74 )， the individual terms 
in in the model 

y = ^ +X*/3* +e (4.74) 

are (unknown) constants or fixed effects. This is the type of model most often used in 
explaining observations from experimental studies, as we shall describe in the follow¬ 
ing chapters. 
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There are, however, situations where we may want to partition /3* and, conformably, 
X* and rewrite (4.74) as 

y = 4 - + e (4.75) 

where the elements of j3{ represent fixed effects and the elements of /3^ are random 
variables having certain distributional properties. Model (4.75) is then referred to as 
a mixed effects model or, for short, mixed model. As an extreme case, model (4.75) 
with (5*2 = ^ is referred to as a random effects model since all the terms, except \i, 
in (4.75) are random variables. The other extreme, of course, is f3{ = f3*, the fixed 
effects model. 

4.18.2 Aitken-Iike Model 

For ease of notation we now rewrite (4.75) as 

y = /i3 + X/3 + Z 7 + e ， (4.76) 

where Z 7 represents the random part of the model. Since 7 and e represent vectors of 
random variables with E{^) = E(e) = 0 and 



var( 7 )= 

=£(770 = V 7 


and 



(4.77) 


var(e)= 

二 E(et') = \ e 



we can rewrite (4.76) as 

y = "3 + X/3 + e* (4.78) 

with 五 (e*) =0 and, using (4.77) and, assuming that 7 and e are uncorrelated, 

var(e*) = ZV 7 Z，+ V e = V*. (4.79) 

Obviously, V* of (4.79) is a real symmetric matrix and assuming that it is invertible 
it appears that we find ourselves in the situation of Theorem 4.2 (see Section 4.16.2). 
The difficulty, however, is that V* in (4.79) depends, generally, on more than one 
unknown parameter, namely in particular the variance and covariance components of 
7 in (4.76). 

Under these circumstances we cannot, obviously, obtain the equations (4.67). An 
easy way out of this dilemma is to estimate the unknown variance and covariance com¬ 
ponents and substitute these estimates in(4.67). In other words, we estimate V* in 

八* • 

(4.79) by V , say, and then solve the Aitken-like equation 

X' (v*)~ 1 Xb = X , (v*) _1 y. (4.80) 

八 * 

The solution of (4.80) depends, of course, on the type of estimation used to obtain V , 
for instance, ANOVA-type estimation. Thus, the solution to (4.80) is no longer BLUE. 
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4.18.3 Mixed Models in Experimental Design 

We conclude this section by giving a brief discussion of the occurrence of mixed mod¬ 
els in the context of intervention studies. Generally speaking, they do not occur very 
often. Referring to (2.3)，we recall that the essential parts of the linear model with 
respect to this question are the treatment effects, the design effects, and possibly treat¬ 
ment x design interaction effects. The design effects refer mainly to blocking effects 
as related to intrinsic or nonspecific factors (see Section 2.2.4). To the extent that the 
levels of a particular blocking factor can be considered to constitute a random sample 
from a larger population of such levels, the effects of that blocking factor may con¬ 
stitute random effects. As the treatment effects are always fixed effects, the above 
situation may thus lead to a mixed model, but never to a random effects model. We 
should emphasize here that if blocking effects are considered to be random effects, 
then also possible treatment x blocking interaction effects are also random effects. 

In its simplest form a mixed linear model can occur to describe data from a block 
design. In the model (see (9.5)) 

Vik 二 M + 汉 + A + Qfc 

the block effects, pi, may in certain situations be considered to represent random ef¬ 
fects. This is of particular importance for the so-called recovery of inter-block infor¬ 
mation in incomplete block designs, as described in detail in Section II. 1.7. The pi 
are considered to be i.i.d. random variables with mean 0 and variance a\. Different 
methods for estimating V* in (4.79) are described in Sections II.1.10 and II. 1.11. 

A somewhat more complicated situation arises when in a block design the block ef¬ 
fects are considered to be random effects and block x treatment interaction is included 
in the model (see (9.75))，that is, we have 

Vike = M + Pi + q + {3r) ik + e ike 

The problem arises with defining the distributional properties of the interaction effects, 
((3r)ik. A commonly used approach is to consider them to be i.i.d. with mean 0 
and variance a% T . A different approach based on finite population and randomization 
theory is presented in Section 9.7.5. Both approaches lead to the same result concern¬ 
ing inferences about the treatment effects, r^, but lead to different results concerning 
inferences about a\. For a general discussion of the controversy in the context of ob¬ 
servational studies we refer the reader to, for example, Hocking (2003), Lencina et al. 
(2005)，Nelder (1994, 1995) and Voss (1999). ^ 
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EXERCISES 

4.1 Prove that the NE X’Xb = X’y is consistent in b for all y. 

4.2 Prove that there exists a p such that X'Xp = 入 implies 入 ' =a’X and vice 
versa. 

4.3 Prove that the shortest solution of a consistent set of equations Ax = b is x = 
A+b. 

4.4 Verify that A - of Example 4.1 satisfies the properties of a generalized inverse. 

4.5 Verify that A+ of Example 4.2 satisfies the properties of a Moore-Penrose in¬ 
verse. 

4.6 Prove that the NE for 

y = x/3 

C/3 = c 

is given by (4.20). 

4.7 Prove the basic properties (i), (ii), (iii) and (iv) of the NE (4.20). 

4.8 Refer to Section 4.6.2 and prove that X’DX is s.i.p. and equals X[X(I — 

C + C)]+. 

4.9 Prove that 

=rank 

4.10 Prove that (4.22), with appropriate conditions on C, is a solution of (4.20). 

4.11 Consider the following balanced data structure: We have four factors A, B, C, 
D, where A and B are crossed, C is nested in AB and D is nested in C. 

(i) Draw a structure diagram. 

(ii) Give all admissible means. 

(iii) Write out an identity for the individual observation in terms of components 
obtained from the admissible means. 

(iv) Write a model in standard notation. 

(v) Impose side conditions on the parameters to remove overparameterization. 

(vi) Write out the ANOVA table. 

4.12 For the three-way cross-classification write out all possible well-defined models. 

4.13 (i) Give a definition for connectedness in a three-way cross-classification as¬ 

suming that all interactions are zero. 


， X 、 


- rank(C). 


rank 


H C A 


C 


0 
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(1,3,3 )， 
(2,3.2 )， 
(3,3,1 )， 
(4, 3:4 )， 


( 14 , 1 ), 
(2,1,4), 
(3,1 ， 3 )， 
(4,1,2), 


Show that the design is connected. 

(iii) Write out and solve the NE for the design in (ii). 

(iv) Write out the model for the design in (ii) in matrix notation and show that 
the sum of squares for any factor is independent of the order of the factors 
in the model. 

4.14 A complete set of linearly independent functions for the linear model y = X/3 + 
e is a set of rx linearly independent functions X f k 0(k = 1,2 ,..., rx), where rx 
is the rank of X, such that all estimable functions can be generated from this set. 

(i) For the linear model 

y%j = fJ> ai bj H - 

(i — 1,2,.... A: j = 1 ， 2 ’ … ， B) obtain a complete set of linearly inde¬ 
pendent estimable functions. 

(ii) Obtain a complete set of linearly independent estimable functions involving 
only the at (i = 1,2 ,.... A) and show that the sum of squares associated 
with this set is identical to SS(A) = 6E(^. —y..) 2 , the usual sum of squares 
for factor A. 


(ii) Consider the three-way classification with factors A, B, C, where each 
factor has four levels. Denote a design point by (z, j, k) where i.j, k = 
1,2,3.4 indicate the levels of the factors A, B, C, respectively. Suppose 
we have the following design points: 


4 3 2 


IX 


4.' 4 ， 4,' 

L' of3.4, 


2 14 3 

2,<>r2.2. 

1 .'2.'3,4. 





CHAPTER 5 


Randomization 

5.1 INTRODUCTION 

We have seen (see Chapter 4) that if we use the GMNLM 
y = X/3 + e e 〜 MVN(0 ， a 2 I )， 

then we can go through the panorama of conventional statistical ideas, that is, estima¬ 
tion of parametric functions, estimation of error, statistical tests, and statistical inter¬ 
vals. 

Insofar as we are merely studying mathematical statistics per se, we have completed 
the basic ideas. But our interest must surely be directed, in part, at least, to the use of 
the ideas in the “improvement of natural knowledge.” 

What are the problems in applying the mathematical material? First, we have to 
envisage a population of repetitions. Ordinarily, in substantive experimental science, 
the population of repetitions over which certain statistical or stochastic properties are 
to hold is defined by the experimental protocol. This will say something like the fol¬ 
lowing: If you do such and such, then such and such will happen. This is, of course, 
merely an assertion. In order to make the assertion one will have done “such and such” 
a number of times, one will have obtained results, and one will demonstrate that the 
data follow the model or class of models one asserts. Whether this will hold up for a 
new trial is. of course, mere speculation, hope or faith. The status is no better than our 
faith that the sun will rise tomorrow morning. This is the classical Humean problem of 
induction. 

5.1.1 Observational versus Intervention Studies 

Suppose now that we have observational data. To be specific, suppose we have ob¬ 
served two groups of humans. One group has been taking large doses of vitamin C 
for five years, and the other group has been taking no vitamin C over and above what 
is obtained in an “ordinary” diet. Suppose further, that we have a measure of the fre¬ 
quency, duration, and intensity of the common cold for each member of each group. 
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Our problem is then to set up a model for the data. What population of repetitions 
are we to assume? What are we to assume about the behavior of deviations from true 
values in this population of repetitions? The answers to these questions are not at all 
obvious. We shall surely be involved in model search, and the outcome of our study 
can have no tighter outcome than the assertions, for which we give, of course, our 
basis, that the data are “like” a realization of a stochastic model, say y = X/3 + e, 
with e 〜 MVN(0. cr 2 I), and that given this basis our estimates are such and such, our 
statistical intervals are such and such, and so on. 

The purpose of the remarks is not to denigrate observational studies. There are 
huge areas of human interest, for instance, astronomy and cosmology, in which we can 
clearly do no experimentation, though we can experiment on many of the scientific 
bases for our theories. At a more human and “down to earth” level, we cannot set up an 
experiment to determine the effects of cigarette smoking in humans. We cannot do an 
experiment to show that thalidomide produced in humans the awful effects we believe 
it to produce. We do have, in this case, very strong evidence that it does so. And, 
we can do actual experiments with other organisms, which we have strong reason to 
believe mimic essentially perfectly what happens in humans. 

We now turn from observational studies to experimental (or intervention) studies. 
We take, for discussion, a “simple” experiment in which the material to be experi¬ 
mented on is humans and the treatments to be compared are interventions to overcome 
the problems of heart disease. These interventions will be taken to consist of no inter¬ 
vention, cardizem, procardia angioplasty, and bypass surgery. The point of the experi¬ 
ment is that there is a subset of our human population which suffers from heart disease, 
and the problem is to obtain information on what the effects of the four named inter¬ 
ventions will be. We prefer the name intervention experiment rather than the vague 
general term, experiment. The point of the study is that the experimental units alter 
over time, and we wish to intervene in the dynamical process of each unit to produce a 
good outcome. 

In attempting to assess the different interventions, we shall call on all the available 
opinions, especially scientific experience that is available. However, the dynamical 
situation is so complex that we shall be forced to do an experiment, or, indeed, many 
experiments. 

Obviously, we shall need a number of experimental units. We shall set up a protocol 
for selecting candidates for experimentation. Obviously, each treatment must be a 
somewhat appropriate treatment for each candidate. 

We suppose that 20 candidates are available. We then have the problem of decid¬ 
ing how to allocate treatments to candidates — with the obvious requirement that each 
candidate can receive one and only one treatment. Suppose we have decided on a treat¬ 
ment allocation and we have imposed that allocation and conducted our experiment. 
We suppose that a treatment period has been chosen before the experiment. Then at the 
end of the experiment we have 20 doublets of, say, (z, t), where i is the name (or num¬ 
ber) of the unit (person) and t is the name (or number) of the treatment, and for each 
doublet the outcome, y(i, t) say, which we take to be a scalar. Our problem is easy to 
state: What conclusions can we draw about differences of treatments? It is obvious that 
treatment effects are confounded with unit effects. The problem is obvious in the case 
of two doublets (1,1) and (2, 2) with observations y(l, 1) and y(2, 2). We do not know 
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what y(l, 2) or y(2, 1) could be. So, if our data triples are (1,1, 20) and (2, 2, 10), we 
can conclude only that we cannot determine whether the difference between the yields 
of 10 and 20 occurs because there is no treatment effect and that unit 1 gives a response 
greater by 10 than unit 2, or that treatment 2 gives a yield 10 less than treatment 1, or 
that unit-treatment combination (1, 1) gives a yield of 20 and (2, 2) a yield of 10. How 
is one to get around this basic indeterminacy? 

5.1.2 Historical Controls versus Repetitions 

We can, perhaps, call on our past experience and say that a difference of 10 ( = 20 — 10) 
has occurred very frequently in observations on the same treatment. Or, we can say, 
from past experience, that a difference of 10 has never occurred under the same treat¬ 
ment. Perhaps we can condense our historical experience into a probability distribution, 
an empirical Bayes experience, that the difference between two individuals on the same 
treatment is distributed with mean /i and variance, 4, say. We would then have to say 
that we have observed a random variable, O say, with mean /i, the unknown treatment 
difference, and standard deviation equal to 2. We can then say that (O — fi) is an ap¬ 
proximate pivotal so that，according to Tchebycheff’s inequality, Prob{|0 - ii\> k2} 
is less than or equal to 1/fc 2 for any fc > 1. 

The procedures of the previous paragraphs are in a general category called “use of 
historical controls.” Undoubtedly, this procedure has been used very widely through¬ 
out the development of science，certainly in physics and chemistry. In the absence of 
appropriate controls, there is no alternative to requiring that a study contain its own 
controls. 

So we now ask if there is an experimental protocol that contains within itself a 
population of repetitions, and such that we may apply our probabilistic models to the 
resulting data and obtain conclusions, as regards statistical tests and statistical intervals 
which we may have faith in because of the experimental protocol we have followed. 

The suggestion of R. A. Fisher (1935) in this respect is the use of randomization. 
We shall follow to some extent the sequence of ideas that Fisher used. 


5.2 THE TEA TASTING LADY 

R. A. Fisher wrote his classic, The Design of Experiments，in 1935. He opened his 
exposition with the most famous experiment in statistical thinking, the lady-tasting-tea 
experiment. The lady of the Rothamsted Experiment Station staff claimed that she 
could discriminate between a cup of tea made with milk and one with tea added first. 
Fisher’s experiment design consisted of making 8 cups of tea with 4 made in one way 
and 4 in the other. The lady was told of this structure. The 8 cups are presented to 
the lady in random order and she has to partition the 8 cups into two sets of 4. The 
interpretation will be made on the basis that there are 70 [= 8!/(4!4!)] partitions. So, if 
the assignment was random, the probability of the lady obtaining the correct partition 
is 1/70, if she can not discriminate. Because 1/70 is a small probability, it is rational to 
conclude that if she obtains the correct partition she has given evidence in favor of her 
claim. 
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Fisher then discusses the experimental technique that must be followed if the prob¬ 
ability of 1/70, under the null hypothesis, can be justified. He mentions temperature of 
infusion and the nature of the cups, as two possible differences, which would invali¬ 
date the probability. He says these are only two possibilities from an indefinitely large 
number of such. If the 8 cups are prepared and laid out for presentation to the lady in 
positions 1 to 8, in 8! ways, indexed by i and we number the possible partitions by 1 to 
70, indexed by j, then there will be probability pij with = 1 of the lady choosing 
partition j. If the correct partition is chosen with probability of 1/70, the probability 
of the lady choosing the correct partition is 秦 [ =This probability will 
hold regardless of the nature of the cups, the method of preparation, and the method of 
presentation of the cups to the lady. The relevant part of the whole prescription is that 
the partition used has a probability of 1/70 regardless of the conduct of the experiment. 
So, if this partition is obtained after everything else has been done, the probability un¬ 
der the hypothesis of no discrimination ability is 1/70. This is true even if 4 cups are 
paper or bone china, or 4 cups inadvertently receive sugar, or whatever. 


The tea tasting experiment discussed above has a special structure in that the out¬ 
come of the experiment comes about because the taster has to make a comparison of 
the 8 cups of tea. The taster does not make a quantitative assessment of the properties 
of each cup. Fisher gives a discussion of the sensitivity of his design which is unsat¬ 
isfactory, as pointed out by Neyman (1950). Fisher uses a definition: One experiment, 
Ei, is more sensitive than another, E 2 , if E\ “will allow the detection of a lower degree 
of sensory discrimination, or, in other words, a quantitatively smaller departure from 
the null hypothesis” (Fisher, 1937, p, 25). For this idea to be implemented, we must 
be able to compare E\ and E 2 with respect to the deviations from the null hypothesis 
that they will express. In an attempt to convince us on Fisher’s ideas, he says that an 
experiment with 12 cups, 6 of each kind, is more sensitive than the one with 8 cups, 4 
of each kind. However, Fisher assumes that a difference from the null hypothesis will 
be the same in the two experiments. This is obviously not the case, because the task 
involves comparisons, and making a partition of 12 cups into two groups of 6, is more 
difficult than making a partition of 8 cups into two groups of 4, when there is the same 
difference between the two types of tea. In the tea tasting example, the only way to 
increase sensitivity is to repeat the study with the same design. So, for instance，with 
two repetitions, the probability of two successes is 1/4900 and of one success is .028, 
under the null hypothesis. 


5.3 TRIANGULAR EXPERIMENT 

It is also useful to mention a much smaller type of discriminatory test, the triangular 
test. The question is whether a “taster” can discriminate between two versions of a 
drink. The taster is presented with two specimens of drink A and one of drink B, 
and is asked to pick out the odd one. In this case，with the proper randomization and 
experimental technique, the probability of being correct in the absence of detection of 
a real difference is 1/3. This test has been used widely in the beer industry, for instance. 
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5.3.1 Medical Example 

Let us now adapt the ideas of the triangular test experiment to a more serious matter. 
The experiment we shall consider is a very small one, and if considered useful, it should 
be repeated many times. Suppose we are considering a disease in humans. The disease 
could be a minor one such as “the common cold” or a very serious one such as cancer. 
Suppose, furthermore, that we have two possible treatments. In the case of the common 
cold, these could be 

1. Go to bed for three days. 

2. Take drug X according to a prescribed regimen for three days. 

In the case of cancer the two treatments could be 

1. Undergo a regimen of radiation plus a potential anticancer chemical. 

2. Do the same as 1 with a different potential anticancer chemical. 

The protocol will be that we decide to give one of the two treatments to one of three 
patients, and the other treatment to the other two patients. We will then have the patients 
examined by a doctor with a prechosen observation protocol and whatever additional 
observations he may choose, and this examining doctor is to specify which of the three 
patients received the odd treatment. The examining doctor will be given, of course, no 
information on which patients received which treatment. This example is interesting 
because of several features which contrast with the beer tasting situation. In that case, 
we can easily envisage obtaining three glasses which are so nearly identical that one 
glass would not be sufficiently different from the other two to cause it to be chosen 
as the odd one. In contrast, our three patients will be not ‘‘nearly identical” by any 
stretch of the imagination. They will surely differ in age, in their whole backgrounds, 
their weights, their dietary intakes, their personalities, and so on. We, the applier of the 
treatments, may have unconscious biases. The doctor who examines the patients at the 
end of the experimental period may have unconscious biases. He may think that the 
appearance of a particular symptom or reaction to treatment is significant. The situation 
is, formally, analogous to the beer-tasting situation, with the presence of a high amount 
of data which may be entirely irrelevant, but that may influence the judgment of the 
evaluator. It is obvious, we suggest, that the randomized triangular test is a candidate 
design for the study. The end result is, of course very simple. Did the evaluator pick 
out the odd treatment correctly? To assert the force of the whole experimental design, 
we adduce the simple probabilistic fact: the probability — and a frequency probability 
at that — that the evaluator makes the correct choice of the “odd” patient, under the 
hypothesis that he cannot really discriminate is 1/3. 

5.3.2 Randomization ， Probabilities，and Beliefs 

Some will attempt to argue, perhaps, that with such a small experiment, randomization 
is unnecessary. To do so is to fail to see the nature of the logical argument that is being 
followed. It could be that the evaluator will pick one patient out of the three to be the 
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odd one for reasons related only to the patients. What then is the probability that he will 
pick out the odd one? There can be a probability only if there is a probability that any 
particular individual is in fact the odd one vis-a-vis treatment. One cannot determine 
probabilities without injecting probabilities. One can calculate a probability only on 
the basis of probabilities of elementary sets. 

The argumentation above has been criticized particularly by some representatives 
of the neo-Bayesian school of statistics. Their idea is, it seems, that without the slight¬ 
est use of randomization in the protocol, the rational person may have the belief prob¬ 
ability of 1/3 that the evaluator will pick out the odd one correctly in the absence of 
there being a treatment-induced basis for discrimination. An assertion of this sort is 
absurd. Who can formulate such a belief probability of 1/3 without an understanding 
of the psychophysical processes of the evaluator? It would seem that what is being 
called on is an assumption of ignorance, namely that with three possible decisions the 
evaluator will make any one of these with a probability of 1/3. One may mention a 
variety of problems. The medical evaluator knows, of course, the disease; he may have 
the idea that fair-haired people are different in their reactions to the disease from dark¬ 
haired people, and he may choose the odd one on this basis. In fact, the study of how 
the evaluator picks out the odd one could be a huge investigation per se. One would 
have to do the following sort of investigation of the evaluator. One would convince him 
that we have done a real experiment, when, indeed, we have not. The evaluator would 
then pick out the odd one, and we could collect a set of data, consisting of attributes of 
each triple of patients and of the patient he chooses as the odd one. One could then at¬ 
tempt to determine how he does, in fact, pick out the odd one. We close the discussion 
of this view with the remark that the force of randomization is accepted throughout 
experimental sciences in which there is unavoidable variability between experimental 
units. 


5.4 SIMPLE ARITHMETICAL 
EXPERIMENT 

5.4.1 Noisy Experiments 

We now consider a class of experiments of the following nature. We have N exper¬ 
imental units, which may be mice, men, plots of land, pieces of steel, 8-hour time 
segments of a functioning chemical reactor, “pieces” of the lower atmosphere, such as 
clouds, or whatever. We have t treatments, one of which can be imposed on any one 
experimental unit. A treatment will then be imposed on each experimental unit and 
after a chosen period an attribute of the experimental unit will be observed, and we 
suppose this attribute to be an arithmetical or interval measurement, such as height, 
weight, percent conversion, tensile strength, or whatever. Let us label our units by 
i (i = 1,2,..., A T ) and our treatments by j (j = 1.2.... .t) and our final observation 
by y'ij • Clearly for given i, we shall have only one j represented. We may represent 
our possible data by a two-way table: 
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Treatment 

Unit 1 2 … t 



Now it is obvious that within any row of this table, we shall have only one cell occupied. 
Suppose that we have an observation yi 3 , that is, unit 1 received treatment 3. Our task 
is to try to understand the observation 2 / 13 . Suppose t /13 = 21. Then we know that 
this is a function of the unit and of the treatment. Consider, for example, the following 
additive model (see Chapter 6 ): 

21 = 5 + 16, 

by which we mean that the unit value is 5 and the treatment contributed 16. Or, for 
instance 

21 = 26 - 5 ， 

by which we mean that the unit contributed 26 and the treatment contributed minus 5. 
Obviously, we cannot distinguish among these possibilities, and indeed we can write 

21 = Wi + 亡 3 ， 

and we can see that there is an infinity of values for u\, each with an associated value 
for ts which satisfy this equation. 

If, of course, one could assert that without treatment, unit 1 would have given an 
observation of 24, say, one could then assert that the observation yis = 21 tells us 
that 亡 3 equals minus 3. In non-noisy sciences one may well be able to be confident 
in taking such a view. But, suppose our observation is weight at age 6 of a child 
entering the experiment at age 5. Obviously, one cannot be assertive about what a 
child of age 5 will weigh at age 6 . Or to take a harder example, suppose we have a 
strain of mice or men, such that say, 20 % develop a certain disease by a certain age. 
Then our task, in order to use this sort of argument, is to state what the status of each 
particular individual will be with respect to the disease at the age of post-experiment 
observation. In the case of mice or men to consider doing this effectively is simply 
ludicrous. Contrariwise, if the experimental unit is a carefully prepared test tube with 
well-defined and well-determined contents, and the question of what the status of that 
test tube will be in, say, 30 minutes, presents no problems. This latter example is a case 
from a nonnoisy science. Though we have to realize that the deeper the investigation 
of any scientific field the more noisy it becomes. So the distinction is not between 
different fields of science in their basic natures, but between different fields of science 
at particular levels of noise. The noise level in basic physics and chemistry which is 
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now at high school or beginning college level, is very low. In the determination of the 
weight of an electron it is higher but controllable. In radioactive decay, it may be high 
and completely uncontrollable. 

5.4.2 Investigative Experiments and Beliefs 

Let us broaden our consideration of the data we might get. Suppose we have 6 units 
and 2 treatments and observe 

Ui 2 = 22. 2/21 = 14, y 32 = 30, = 18, y 51 = 24, y 62 = 42. 

We see that the average for treatment 1 is 181 and for treatment 2 is 31^. Are we then 
to infer, surmise or guess, that treatment 2 gives a result greater than treatment 1 by 
12|? How indeed, may we make such a surmise? We may make the following sort of 
statement: We have looked at the set of 3 units which received treatment 1 and the set 
of 3 units which received treatment 2, and we believe that these two sets would give 
nearly the same means if, in fact, there were no treatment effect. Or one might surmise 
the following: We believe that the difference between the two means in the absence of 
a treatment difference would have been no more than 4; so we believe the effect of the 
treatment is somewhere between 81 and 16|. 

We suggest that while our beliefs should be given some weight, one simply does 
not know how much weight to give them. We have well-trained and well-intentioned 
investigators who exhibit a wide panorama of beliefs. The task of investigative experi¬ 
mental science is to remove the role of personal belief that cannot be validated in some 
way. And the fact that an individual has had a good record of his prior beliefs being 
sustained by investigation is a weak straw (but in many cases, let it be said, the only 
straw) on which to base our own outlook. Perhaps some examples, without possible 
citation, should be given: 

(a) Some years ago, a highly trained and successful medical worker had the very 
strong belief that stomach ulcers in humans could be cured by freezing the stom¬ 
ach for a period. 

(b) For several years, some high experts had the strong belief that birth control pills 
were entirely without risk. 

(c) For many years, some scientists with excellent records have had the strong belief 
that cloud seeding does, in fact, produce rain. 

(d) Some well-trained scientists have the strong belief that the eating of high-cholesterol 
foods does not increase the risk of heart disease. Other well-trained scientists are 
convinced of the opposite. 

(e) Some well-trained scientists believe strongly that present-day uses of weed killers 
and insecticides are not producing a poisonous environment for all organic life. 
Other very well-trained scientists believe totally the opposite. 
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To continue with such a list would be rather tedious，but we do suggest to the reader, 
and particularly any neo-Bayesian who happens to read the above material, to construct 
his or her own list of problems on which well-trained, seriously-intentioned scientists, 
who should be given some partial degree of credence, have widely opposing views. It is 
true, of course, that at the end, for example, after some tens, hundreds, and thousands 
of years, enough data will have been accumulated to cause scientists to agree. But 
after that passage of time, other questions will have arisen on which there is the same 
diversity of belief. So the neo-Bayesian answer that with infinite data, all reasonable 
people will agree has truth, but we must counter this with the fact that the essence of 
science is that the questions at issue change over time. The problem is not to reach the 
correct answer with infinite data; the problem is to make an approach to truth, and to 
avoid the prejudices and biases that we inevitably accumulate. 

5.4.3 Randomized Experiments 

How are we to tackle the dilemma of interpretation of the 6 observations we mention 
above? To some, and indeed to Fisher，the answer is obvious. We must use a random¬ 
ized design; we are to select from the 6 units at random 3 units which are to receive 
treatment 1 with the remaining 3 units to receive treatment 2. The basic idea is, after all, 
rather natural. Underlying our investigation is a 2 x 6 table of potential (or conceptual) 
observations: 


Unit 



1 

2 

3 

4 

5 

6 

Mean 

Treatment 1 

yn 

2/21 

2/31 

"41 

2/51 

yei 

y.i 

Treatment 2 

yi2 

V 22 

2/32 

2/42 

Vb2 

V62 

y .2 


Our task is to form opinions about the difference between the average of the first row 
and the average of the second row, with the restriction that we can observe only one 
number in each column. It is natural, then, to select 3 elements from the first row 
and this then determines a set of 3 elements in the second row. We now do a little 
elementary mathematics of finite population sampling. Let us consider the total for 
treatment 1 under the sampling. There are in fact 6 numbers, yn, i = 1, 2,.... 6, and 
we select at random 3 of these. Call the average of the sample Y \. Then Y\ is a random 
variable, and we know 


E(Yi)=y.i 



Similarly, with ¥2 equal to the average under treatment 2, we have 


E{Y 2 ) = y. 2 
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and 

/■O 、 /6 - 3 、 ▽( 扒 2 - 5.2 ) 2 

—y 2 ) = ( 了 —-— . 

\ ’ i 


Here we use the basic formulae for the expectation which is obvious, and variance 
of a sample mean with sampling from a population of N finite numbers {x{.i = 
1,2,..., A r } which is 



(Xj - x .) 2 
N -1 


We are not through, however. We cannot use 


var(?x — 巧） =var(Yi) + var(F 2 ), 


because the samples under treatment 1 and under treatment 2 are not independent. 
This gives us a reason to exposit a little more elementary theory of finite sampling. 
Let & = 1 if unit i receives treatments 1; = 0 otherwise (z = 1. 2...., 6). Then our 
estimator of the difference 3(y.i — y. 2 ) is equal to 

T = Siyn + ^ 2/21 H - + ^eV6i 

— [(1 — Si)y\2 + (1 — ^ 2 ) 2/22 H - + (1 _ ^ 6 ) 2 / 62 ] 

=^^(yn + y i2 ) -&y. 2 - (5.1) 

i 

Under random sampling we have the following properties: 

E(Si) = I E{S^) = I ， var^) = i = 1.2, … ，6 (5.2) 

= (I x I) = I 

cov(5i, 如 ) = \ ~ \ — ( 以 O. (5.3) 


We see this because 5^5^, is 1 or 0 and is 1 only if treatment 1 falls on i, which has 
probability and then given that treatment 1 falls on i, treatment 1 falls on i' the 
probability of which is 2/5. So we have, using (5.1) ， (5.2), (5.3), 

E{T) = 3(y.i - y.2) 

var(T)^var|^ Si(yn + Pi2) 

=\ + Vi2 ) 2 - ^ JoiVil + Vi2){yi f l + "i'2). ( 5 . 4 ) 

i i,i , 


Let 


Yi. = Uii + Ui2 = 1 , 2 ,..., 6 ). 
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Then (5.4) can be written as 

var(T) = j EdE 砂 . 

l i.i 1 

= \ ~ ^ ( y2 ~ H Y i. 

i \ i 

= I) H y t 2 - 



Now suppose that treatments have additive effects, that is, the number with unit % and 
treatment j, which we have denoted by yij, is made up additively of a unit effect, Ui 
and a treatment effect Tj (see also Section 6.3.1); that is, 

Vi j = • 

Then 

= 2ui + ti + r 2 (5.6) 

Y. = I2u. + 6 ri -h 6 r 2 (5.7) 

and we see that by substituting (5.6) and (5.7) into (5.5) 

var(T) = )(4)E (^-^) 2 

\ / i 

= 6 — u.) 2 /5 = 6cr 2 

i 

if we define 

o 2 = - u .) 2 / 5 . 

i 

Finally, since Yi ~ Y 2 = it follows that the variance of the difference of the 
treatment means is 

var(Fi - Y 2 ) — ^ 6 cr 2 = 2(cr 2 /3). 

This result is formally the same as we get with the model 

Vij = P + 丁 j + e.ij 

with the e{j having the Gauss-Markov properties. 

Also, elementary computation shows that the expectation under randomization of 
the mean square within treatments is equal to the quantity (or parameter) a 2 . 

A single such small experiment cannot, of course, tell us a lot. We can certainly 
imagine repeating this experiment a number of times. For each repetition we will have 
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an estimated mean difference and a standard error, an estimated standard deviation of 
this estimated difference. Then we will look at the collection of these results. Obvi¬ 
ously, we may apply the Central Limit Theorem to infer that the average of a number of 
such experiments will be normally distributed, and we may apply ordinary tests (using 
normality) on the average, with a standard error based on the variability between the 
mean differences for the separate experiments. 

As regards the assessment of the significance of an observed mean difference in 
a single such experiment we will use a procedure called the randomization test pro¬ 
cedure. Before we give a general description of the randomization test we shall trace 
briefly the ideas of randomization and the resulting test procedure as developed by 
R. A. Fisher. 


5.5 RANDOMIZATION IDEAS FOR 
INTERVENTION EXPERIMENTS 

The test procedure is an outgrowth by Fisher of his discussion entitled “The Arrange¬ 
ment of Field Experiments,” which was published in the Journal of the Ministry of 
Agriculture in 1926. The ideas are expressed in terms of agricultural experiments, nat¬ 
urally, because Fisher was then statistician of Rothamsted Experiment Station. This 
paper should be read by all students of experimental design. In the field experiment, 
treatments are applied to plots of land, and the questions considered are how this as¬ 
signment should be made and how results with different treatments should be evaluated. 
The reader should realize that the arguments apply to any interventional comparative 
experiment: for example, on humans, on animals, on engineering material, and so on. 

Fisher bases his whole argument on the use of tests of significance. He asks (Fisher, 
1926, p. 503): “When is a result significant?” and then “What is meant by a valid 
estimate of error?” He discusses what may be called the use of historical controls, dis¬ 
missing them in the context of agricultural experiments (erroneously, to some extent, 
we judge). He states (p. 504): “A scientific fact should be regarded as experimen¬ 
tally established only if a properly designed experiment rarely fails to give this level of 
significance.” To obtain a valid estimate of error, he advocates the use of replication, 
that is, the occurrence of the same treatment on different plots. He says that we wish 
to quantify the differences between plots with different treatments. We do this by ob¬ 
taining the estimate of error from differences between plots that are treated alike. This 
estimate “will only be valid if we make sure that in the plot arrangement, pairs of plots 
treated alike are not distinguishable from pairs of plots treated differently” (p.506). 
This prescription is, however, not realizable. In the case of the field experiment, in 
addition to the positions of the plots, each plot will have many attributes, e.g,, nature 
of soil, pH, amount of N, P, and K, and so on. Each plot thus will be representable as a 
point in a space with a large number of dimensions. The Fisher prescription described 
above requires “pairs of plots treated alike be not distinguishable from pairs of plots 
treated differently.” Fisher says: “An experiment either admits a valid estimate of error 
or it does not: whether it does so or not, depends not on the actual arrangement of plots, 
but only on the way that arrangement was arrived at” （ p, 508). So “If the arrangement 
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ABB A ABB A was arrived at by writing down a succession of sandwiches ABBA, it 
does not admit of any estimate of certain validity” （ p. 508). Furthermore, according to 
Fisher: “If the same arrangement happened to occur subject to the condition that each 
pair of strips shall contain an A and a B, but that which came first shall be decided 
by the toss of a coin, then a valid estimate may be obtained from the four differences 
in yield in the four pairs of strips.” He continues: “Thus validity of estimation can be 
guaranteed by appropriate methods of arrangement...” （ p. 508). Later, he says: “ex¬ 
periments capable of genuine tests of significance can” easily be designed to be very 
much more accurate than any experiments ordinarily conducted” （ p. 508). 

We have found the early writings of Fisher discussed above at best obscure and not 
entirely coherent. 

The beginning of Fisher’s arguments lies with the use of significance tests. Unfor¬ 
tunately, Fisher never made clear his ideas on this: the obscurity on this has plagued 
statistics for the past 80 or more years. Fisher was fond of (obscure) classical theory of 
errors. He did not make a clear distinction between what we call observational studies 
and interventional (comparative experimental) studies. We are concerned in this book 
only with the latter. 

Fisher made his ideas of 1926 more clear in his book The Design of Experiments. In 
this book, he pursued his ideas on randomization and made the statement: “The purpose 
of randomization in this, as in the previous experiments exemplified, is to guarantee the 
validity of the test of significance, this test being based on an estimate of error model 
possible by replication” (Fisher, 1937, p. 71). Singularly, in connection with the Latin 
Square design (see Chapter 10) he says: “The purpose of randomization, necessary 
to ensure the validity of the test of significance applied to the experiment, consists in 
choosing one at random of the set of squares which can be generated from any chosen 
arrangement” (Fisher, 1937, p. 80). 

Fisher discusses estimation of error and tests of significance by means of analy¬ 
sis of variance. He continues his treatment by use of tests of significance based on 
the comparison of certain mean squares in the analysis of variance he considers to be 
appropriate. He continues his discussion with a treatment of “systematic squares” of 
which the prime example is the Knut Vik square: 


A 

B 

C 

D 

E 

D 

E 

A 

B 

C 

B 

C 

D 

E 

A 

E 

A 

B 

C 

D 

C 

D 

E 

A 

B 


The point of this square and of the name is that the positions occupied by any treatment 
are given (nearly) by the knight mode in chess. Actually, this is not achieved: consider 
A which is in the sequence of cells, (1,1), (2,3), (3,5), (4,2), (5,4). The move from (3, 
5) to (4, 2) is not a knight move. “In this arrangement, the areas bearing each treatment 
are nicely distributed over the experimental area ， ..(Fisher, 1937, p. 87). He says: 
“The total ... ascribed to treatments and to error” is “independent of the experimental 
arrangement.” He continues his discussion with the remark, “The failure of systematic 
arrangements came from not recognizing that the function of the experiment was not 
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only to make an unbiased comparison, but to supply at the same time a valid statement 
of its significance” (Fisher, 1937, p. 89). 

In fact, the Knut Vik square given above is one of two possible, apart from treatment 
names, so it will be seen that the only test by analysis of variance combined with 
randomization can give levels of significance of 50% or 100% only. 

The way out of the dilemmas was given by Fisher (1937, Section 21) without real 
understanding. He says: “In these discussions it seems to have escaped recognition that 
the physical act of randomization, ..affords the means ... of examining the wider 
hypothesis in which no normality is implied” (Fisher, 1937, p. 51). His procedure 
is the use of the randomization test, which is totally related to the randomization that 
was used. 

We now turn to this, which in Fisher’s words shows the possibility of an indepen¬ 
dent check on the more expeditious methods in common use. 

5.6 GENERAL IDEA OF THE 

EXPERIMENT RANDOMIZATION TEST 

We first describe the particular example of Fisher. He has 15 pairs of cross-fertilized 
and self-fertilized plants and the differences in yield between the former and the latter. 
On the assumption that the members of each pair have been applied to pairs of sites at 
random, the 15 differences would have occurred with equal frequency with a positive 
or with a negative sign. The observed total difference was 314 and in the total of 
2 15 possible (=32,768) arrangements this difference was equaled or exceeded in 863 
cases. The difference in absolute magnitude would be exceeded in 1,726 cases, so the 
significance level by this procedure is 1.726/32,768 = 5.267 percent. This may be 
compared with 5 percent given by the normal theory based t test. 

Turning now to comparative experiments, we suppose that we have t treatments that 
are to be compared using N experimental units. We decide that we shall have units 
for the ith treatment with = N. We wish to determine the acceptability of the 
hypothesis that the treatment differences {r^ —= 6ij}. We can, obviously, adjust the 
observations according to the hypothesis so that all observations arise under treatment 
1. For example, if the hypotheses are t\—T2 = 3, n - ts = 5,ri -r 4 = 一 4, etc. (with 
being the treatment effects) then the original data y U i (u denoting the experimental 
unit and % the treatments) are adjusted as follows: 

y* Ul \ = yu x \-, y:2 = yu 2 2 - s 

V* Ui z = Vu 3 3 - 5, y* Ui4 = y U4 4 + 4 

etc. We then evaluate the resultant data y* { with respect to the null hypothesis that the 
treatment differences are null. 

The logical basis for the test is that if the hypothesis {ri — r 3 = Oij} is true, then 
after the adjustment the data should be a realization of the data we would observe if 
there were no differences in treatment effects. Hence, we can apply the test of this null 
hypothesis to the adjusted data. 



5.7. INTRODUCTION TO SUBSEQUENT CHAPTERS 


151 


What test criterion should we use? We suggest that a good criterion is given by least 
squares. We shall by randomization choose one of a set of experimental plans. We shall 
compute the test criterion for the actual plan, equal to C a , then we shall compute the 
criterion for each of the plans of the set and shall compute the significance level with 
respect to the hypothesis, {t{ — Tj = ^}, as 

- [numbers of C > C a ], (5.8) 

s ’ 

where s is the logical number of possible plans. 

A simple and instructive example of the randomization test for a design discussed 
above with ^ = 2, A^i = A ^2 = 4 is described by Kempthorne (1952, p. 130). Using 
(5.8) with s = 70, the significance level for the randomization was found to be .71 
which compares favorably with the significance level of .63 using the usual F-test for 
the one-way analysis of variance. This is an important point to which we shall return 
later (see Chapter 6). 

5.7 INTRODUCTION TO SUBSEQUENT 
CHAPTERS 

We have given above the basic ideas and philosophy of randomization in a comparative 
experiment illustrating some basic aspects with the completely randomized design. For 
further discussion of intervention experiments, randomization, and inference we refer 
the reader to Kempthorne (1992). 

We shall in Chapter 6 give more detailed theory of randomization for the com¬ 
pletely randomized design. We shall discuss estimation of differences of treatment 
effects, estimation of error, and tests of significance. This is in strong agreement with 
the views of Cox (2006, p. 192) who states: 

“Randomization has three roles in applications: as a device for eliminating 
biases, for example from unobserved explanatory variables and selection 
effects; as a basis for estimating standard errors; and as a foundation for 
formally exact significance tests.” 

We shall then give some mathematical and simulation results about the approxima¬ 
tion of the randomization test by the corresponding F-test in the analysis of variance. 

Then we shall repeat the same line of development for increasingly more com¬ 
plicated structures such as the randomized complete block design (Chapter 9)，Latin 
square design (Chapter 10)，and designs that are of split-plot nature (Chapter 13). One 
important feature of this approach will be to show how the physical act of randomiza¬ 
tion influences the statistical analysis of data from various experimental designs and 
how, as a consequence, randomization determines what statistical inferences can be 
drawn from such data and how it should be done. 
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CHAPTER 6 

Completely Randomized Design 


6.1 INTRODUCTION AND DEFINITION 


The simplest error-control design for comparative experiments is the completely ran¬ 
domized design. Its use and usefulness is predicated by the availability of a set of 
homogeneous experimental units (EU) (for the description of an EU see Section 2.3). 
The word “homogeneous” in this context should not be interpreted too narrowly. As 
explained earlier, there do not exist in nature identical EUs and hence homogeneous 
here means “alike to the extent possible,” Even that phrase is quite relative. The vari¬ 
ability among EUs that arise naturally, for instance, humans of a given gender, within 
a certain age range, with a certain disease, will be much higher than the variability 
among EUs that have been manufactured, for example, test tubes under controlled con¬ 
ditions. And yet, in both situations the use of a completely randomized design may be 
quite appropriate. The implications, however, will become evident as we discuss the 
nature of this design in more detail. 

We shall now give the formal definition of the completely randomized design and 
discuss in subsequent sections the randomization procedure, the derived linear model, 
tests of hypotheses, and ’’sample size” considerations. 

Suppose we have t treatments and N = tr homogeneous EUs. Let the tr EUs be 
partitioned randomly with equal probability into t sets of r EUs. Let the t treatments 
be assigned to the t sets such that the zth treatment is applied to each of the r EUs 
in the ith set (i = 1,2,..., t). This procedure defines the completely randomized 
equal replication design for t treatments. A realization from this protocol is called a 
completely randomized equal replication experiment. In what follows we shall discuss 
mainly the equal replication situation and hence we shall refer to such a design simply 
as a completely randomized design (CRD). 

It is clear from this definition that one has a randomized design if and only if one 
has randomized the assignment of the treatments to the EUs. We shall now describe 
the randomization process more formally and show how such a mathematical charac¬ 
terization leads to the formulation of a linear model and to the analysis of data from 
such an experiment. 


153 



154 


CHAPTER 6. COMPLETELY RANDOMIZED DESIGN 


6.2 RANDOMIZATION PROCESS 

6.2.1 Use of Random Numbers 

The randomization process for the CRD can be described in various ways which are 
equivalent to the following: 

Label the EUs 1 ， 2, … ， A r . Make up chips labeled by A: = 1 ， 2, … ， iV. Draw 
a chip after shaking and label it (11). Discard the chip. Draw a chip after shaking 
and label it (12). Discard the chip. Draw another chip after shaking and label it (13). 
Continue this process and thereby establish a random correspondence of 1, 2,. •., iV 
with the tr labels 11,12, . • • ， lr, 21, .. • ， 2r ， ... ， , tr. If chip k, and hence EU 
k, is associated with the label (if )，apply treatment i to this EU. More precisely, this 
will be the jth application of treatment i. 

This is, of course, equivalent to establishing a random association of the numbers 
1 ， 2, ... ， iV 二 tr to the set of tr labels 11,12,.... ir by using a table of so-called 
random numbers. As an example, suppose iV = 24 and t = 6, r = 4. Then with 
two-digit random numbers we may discard all the numbers except 01 ， 02,..., 24. This 
will prove very tedious. Instead, we may discard 00, 97, 98, 99. We then associate the 
random numbers 1, 2, 3, 4 with the EU number 1, the random numbers 5, 6, 7, 8 with 
the EU number 2, and so on. If we get a repetition of the associated EU number, we 
ignore it. So if our random numbers are 

07, 21，34, 65, 43, 22, 05, 83, 77, ••• 

our associated EU numbers are (ignoring repetitions) 

2, 6, 9, 17, 11, 6, 2, 21 ， 20,… 

and the associated labels are then 

11, 12, 13, 14, 21, 22, 23, 24, 31’ … 

Consequently, EUs 2, 6, 9， 17 receive treatment 1, EUs 11 ， 21， 20 receive treatment 
2, and so on. Even this process will become tedious, and one may have to alter the 
algorithm at one stage to pick a random member of the unselected subset. 

An alternative procedure is to use a computer program such as given, for example, 
by SAS PROC PLAN (SAS Institute, Inc., 2002-2003), as illustrated in the following 
example. 

Example 6.1: For t = 4, r = 2 the SAS input statements are given in Table 6.1a. 
and the actual design (presented in two different forms) is given in Table 6.1b. We note 
here that the seed number used in Table 6.1a. can be chosen freely. □ 


6.2.2 Design Random Variables 

The whole randomization process can be expressed mathematically as follows. Let 
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If, for instance, 符 2 = 1， that is, EU 2 receives label 12 and hence treatment 1， then 
Sh = S 21 = 622=0 and also 5\ 2 = 8\ 2 = 巧 2 = 0. This is so because if EU 
2 receives label 12, then EU 2 cannot receive any other label and label 12 cannot be 
associated with any other EU. 口 

Hence, we have, in general, 

E d t = 1 ， = 1 

k ij 

expressing the fact that one EU receives label ij and EU k receives only one such label. 
This implies, for example, 

P(^. = = 1) = 0 (ij) ^ 

确 == 1 ) = 0 (ij)^ (i f f) 

P((5^- = 1|<5 怎 ’ =1) = 0 k ★ k’. 

Probabilities involving three random variables , S^j,, can also be derived eas¬ 

ily, and so on. All one needs to recognize is that the {J&} are simple Bernoulli (0, 1) 
random variables and that they are identically but not independently distributed. We 
have a peculiar, but highly structured dependence. 


J 1 if EU k is associated with label ij 
13 1 0 otherwise, 

such that the following probability statements, denoted by P(.), hold: 




p {^ 3 = i) 
p(4 = = 1 ) 

1 . S^ijf = 1 , 1 ) 


l/N. 

l/N(N - 1), 
\/N[N — 1){N ‘ 


ky^k', 

• 2 ), 




k ， k’ ， fc^unequaL (ij), [i’j% {i n j n ) unequal, 


and so on. 

The variables S^(k = 1,2..... N = rt:i = 1,2,... ,t\j = 1, 2,… ，厂 ） are called 
design random variables. There are (rt) 2 such random variables. Obviously, these 
have many dependencies among them as illustrated in the following example. 


Example 6,2: Consider t = 2.r — 2. N = 4. Then we have 


122222;3c 
ro 5 rG ro 

1111 
12223242 
ro ru Xu 

2 2 2 2 
' ― -1— -2 13 1—141— f 

ro 5 Co ro 

1111 
i — 11213141 — _ 

ro 5 ro 
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Table 6.1 Randomization Procedure for CRD 


a. ) Input Statements: 
proc plan seed= 17683; 
factors unit=8; 

treatments treat=8 cyclic (1 1 2 2 3 3 4 4); 
output out=CRD; 

title 1 ’COMPLETELY RANDOMIZED DESIGN (t=4, r=2, N=8)’; 
run; 

proc sort out=CRD; 

by unit; 

run; 

proc print; 
run; 

proc sort out=CRD; 

by treat; 

run; 

proc print; 
run; 

b. ) Output: 


Factor 

treat 


COMPLETELY RANDOMIZED DESIGN (t=4, r=2, N=8) 


Factor 

unit 


The PLAN Procedure 
Plot Factors 

Select Levels Order 

8 8 Random 


Treatment Factors 


Select Levels Order 


Initial Block 
/ Increment 


Cyclic (11223344) / 1 


treat 


62387415 


11223344 
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Table 6.1 (Continued) 


Obs unit treat 

1 1 4[h] 

2 2 1 

3 3 2 

4 4 3 

5 5 4 

6 6 1 

7 7 3 

8 8 2 


Obs unit treat 

12 1 
2 6 1 

3 3 2 

4 8 2 

5 4 3 

6 7 3 

7 14 

8 5 4 


We shall now use the mathematical formulation and the statistical properties of the 
randomization procedure to derive a linear model for the observations from a CRD 
together with their distributional properties, following Kempthorne (1952; 1955). 


6.3 DERIVED LINEAR MODEL 

6.3.1 Conceptual Responses and Observations 

If EU k receives the label ij, then treatment i is applied to EU k. At the end of the 
experimental period an observation is made which we denote by yij. It is, in fact, 
the observation on the jth occurrence of the ith treatment. We may suppose that if 
treatment i is applied to EU k the true (or conceptual) response is a number, say. 
We suppose that if we could, in fact, impose every treatment on every EU, we could 
observe the totality of numbers {Tik}. But we cannot do this as we can apply only one 
treatment to each EU and that is determined by the randomization process. Using the 
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design random variables, we can then link the and the as 

N 

Vi ： = J2 5 iJ Tik - ( 6 - 1 ) 

k=l 

This states that if EU k receives the label (/, then we observe T^. The {TU} are fixed 
numbers under repetitions of the randomization. This is just a t x iV array of numbers, 
and we may write the identity 

Tik = T.. + {Ti, — T..) + (Tfc — f..) + {Tik — Ti, — T\k + T..), (6.2) 

where T.. is the overall average of the T^; is the average of all conceptual responses 
for the ith treatment; is the average of all conceptual responses for EU k. 

We shall now assume that 

J\k =Ti~\~Uk 、 (6.3) 

that is, the response of treatment i applied to EU k is made up additively from a contri¬ 
bution due to the ith treatment, and a contribution due to the fcth EU, Uk. We refer 
to this as additivity in the strict sense. It follows then from (6.3) that 

T.ik - fi. - f.k + f= 0 

and hence (6.2) reduces to 

Tik = T.. + (f^. — T 7 ..) + (f.A ： — f..) 

which, using (6.3), we rewrite as 

T ik = (f. + U.) + {Tt - f.) + (U k - U.) (6.4) 

with T. being the average of all treatment contributions T“ and U. being the average 
of all EU contributions Uk. Letting 

ji = T. -\- U., T{ = Ti — T., Uk = Uk — U• 



(6.5) 


( 6 . 6 ) 
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where we have used the property that = 1. Let 

^ij = 〉 ： • (6.7) 

k 

Then, finally, yij can be written as 

Vij — f-l Ti LVij . ( 6 . 8 ) 

We refer to (6.8) as the derived linear model associated with the CRD. 

6.3.2 Distributional Properties 

In (6.8) the only random variable on the right-hand side is u ： ij. Its distributional prop¬ 
erties are determined entirely by those of the S^. Denoting the expectation operator E 
and the variance and covariance operator var and cov under the randomization model 
as Er, var 丑， and cov_r ，respectively, we obtain first 

Er^j) = j T 

vaa ( 硌） = 办瞒 ) 2 ) - [E R (S_ 2 

丄（丄 V 

N \N 


N \ N, 


cov ^5^,6^,) 


k = k', (ij) ^ {i'f) 
k / k 1 , (ij) = (i'f) 
(ij) ^ (i’/). 


_ AT2(iV - 1) ^ r 

Using these results we obtain further 

EnM = J2E R (S^)u k 

k 

= = 0, 

k 

V&X R (LOij )= ^2ya.T R {S^)ul + ^ cov R (S^,S^) UkUj^f 


nV~n 


Yl u2 k~ UkUk， 


N ) N 2 




J2 u l 
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since 芊 k r UkUk f = Defining 


E 雜 r -i) 


( 6 . 10 ) 


we then write (6.9) as 


varH(c^) = (1 - erg. 


Similarly, we find for ij ^ i’j’ 




( 6 . 11 ) 


( 6 . 12 ) 


Recall that Uk = Uk~ U. denotes the deviation of the contribution of the kth EU from 
the average contribution of all EUs. Then can be interpreted as a measure of the 
variability among the EUs, that is, the heterogeneity of the EUs. Also, may then be 
referred to as unit error. 

We note here parenthetically that the above covariance structure is interesting as 
an example for which the simple least squares estimators are best unbiased estimators 
(see Section 4.16.3). 

From the results above and the model (6.8) we can easily derive 

E R [yij) = " + n 
E R {yi,) = /i + r, 

with yi. = mjVij, 


vai R {yij ) 二 v&r R (u!ij) 


var H (R) 




r 1 


v iv 广 

jr)<r 2 u-r(r-l)^a 2 u 


JSh 


COV R (H) = — yd (i # 0. 


(6.13) 

(6.14) 


If we consider a contrast among the treatment effects say EjCjTi with = 0, we 
find immediately that Y^iayi, is an unbiased estimator for that contrast, that is, 


Er 


(? 


with 


var_R 


E 


c iVi. 


C.iyi. 


Y^ ciTi 

i 

E 碎 


(6.15) 


using (6.13) and (6.14). 
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6.3.3 Additivity in the Broad Sense 

Our discussion up to this point has been based entirely on the model (6.8) under the 
assumption of additivity in the strict sense. We have mentioned earlier (see Chapter 2) 
that associated with each observation are two error components: experimental error 
and observational error. The only error we have encountered so far is the unit error cjij 
associated with the observation yij. The unit error is part of the experimental error, 
but in order to incorporate other error components we must broaden our model and our 
assumptions. Let 

yik = T ik + M ik = Ti + f7/c + Mik (6,16) 

denote the conceptual observation from EU k to which treatment i has been applied. 
We shall refer to (6.16) as the model under additivity in the broad sense. The compo¬ 
nent expresses what we might call technical error (Wilk and Kempthorne, 1956). 
This includes: 

(i) treatment error; that is, error due to our inability to replicate a treatment from 
one application to the next; 

(ii) state error; that is, error due to random changes in the physical state of an EU; 

(iii) selection error; that is, error due to the random selection of EUs for the experi¬ 
ment; 

(iv) measurement error; that is, error due to imprecision in our measurement or scor¬ 
ing procedure; 

(v) sampling error; that is, error due to the random selection of observational units 
(OUs) for the investigation. 

We may consider the errors (i), (ii) and (iii) as part of experimental error, and errors 
(iv) and (v) as observational error. Accordingly, it is convenient to partition as 

= Eik + (6.17) 

to reflect these two components. Using (6.17) and (6.5) we then rewrite (6.16) as 

十 H - EiJ^ H - Oik • ( 6 . 18 ) 

This is now the conceptual response of applying treatment i to EU k. The actual obser¬ 
vation yij can then be modelled, following (6.1), as 

Uij = 

k 

=".+ E S^jUk + Z + ^ ^ijOik 

k k k 

= ".+ Tj + ^ij ~\~ Tjij . 


(6.19) 
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say，where 


u ij = 也 Eik ， 

k 

Vij = ^ij Oik - 

k 

As an illustration of the error structure as described above we consider the follow¬ 
ing example. 

Example 6.3: Suppose we want to compare different spraying regimens for peach 
trees in an effort to increase the yield and improve the quality of the fruit. We have 
available an orchard consisting of trees of the same variety and same age. For each of 
the t regimens we randomly select r trees. The trees are then sprayed at a specified rate 
on several specified occasions throughout the growing season. We can then identify the 
various error components as follows: 

(i) treatment error: even though the rate is specified for each tree and occasion 
the rate may not be achieved exactly and/or the spray may not cover the tree 
uniformly; 


(ii) state error: the trees may grow differently during the growing season due to 
different micro climates such as wind, moisture, sun exposure; 

(iii) selection error; different trees could have been included in the experiment; 

(iv) measurement error: the judgement in assessing the quality of the individual 
peaches may not be quite uniform; 

(v) sampling error: typically only a few peaches per tree are judged for quality, they 
are picked at random and hence different peaches could have been selected. □ 


6.3.4 Error Structure 

The quantities and Oi/c are random variables with mean zero. Furthermore, the 
and are statistically independent. Hence 

E(Vij) = E(rjij) = Q 

and 

var(^) = a 2 v 
vax(7 ?ij ) =： 

say. It then follows that 

E (jjij )= 以 + Tf 
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and 


var(^)= 



Furthermore, as an extension of (6.13) and (6.14) we have 


var ( 仄 .)= -( 丄 -y) 4+4 + 4 

cov(H) = —+ 4 (i ^ i') 
so that, corresponding to (6.15), we find, for ^ Ci = 0, 


var 


/ 




\ i 


( 6 . 20 ) 


Expressions (6.15) and (6.20) for var(Sci 识 •）are the same as would have been ob¬ 
tained if the were uncorrelated with variance We shall, therefore, from now 
on, mainly to facilitate the computation of variances and other functions of the obser¬ 
vations yij, treat the Uij as if they were independently, identically distributed (i.i.d.) 
random variables with mean zero and variance o\. Since and z^j are components 
of experimental error it is then also convenient to combine them into one term and 
define the random variable 

= L’ij + 

to be the experimental error with 


E{dj) = 0 

and 

var(y) =(J 2 S = crl + al, 

that is, we may consider the Sij, for purely practical reasons, also as i.i.d. random 
variables. It follows then from (6.19) that 


= m k 

and from our earlier discussion that 


var( 2 / 勿 ■) = a^ + a^, 

where cr^ is referred to as the experimental error variance component, and as the 
observational {sampling) error variance component. To condense the notation even 
further we shall find it usually convenient to use a single error term 


e ij 


~ e ij 


+ 


Vij 


with 


var(ey) = ^ = a 2 e + cr 2 ^ 


(6.21) 
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6.3*5 Summary of Results 

We can summarize our discussion up to this point as follows: 

(i) Under the assumption of additivity in the strict sense, the randomization process 
leads to the derived linear model 

Uij = ^ 的 j Tik 

k 

= YjU Ti + lJ k) 

k 

-/i•+n+ s^uk 

k 

=+ Tj + UJij • 


(ii) The u)ij have mean zero and a simple covariance structure 






(iii) A treatment contrast Eci^ is estimated unbiasedly by the same contrast in the 
treatment means, that is, He 办 . with 


vaxi? E 帥 . 


(iy) Under the assumption of additivity in the broad sense, the model in (i) is amended 
by technical error components to a partly derived, partly assumed model 


with 


Vij ~ 

k 

^Y, 5 iji T > + u k + E ik + o ik ) 

k 


coy{yij,yi'r) 


E[Vij) — ^ T i 

_ 去） 4 
~Jj a l 


i}j) = (i'f) 
(ij) ★ {i'f) 


(v) As an extension of (iii), for Ci = 0, 

E ( 〉： 讲 .)— 〉 ： cj 丁 i 

var ^ ^ cj{al +a 2 v +al j )/r 
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(vi) Because of the result in (iii) and (v) we treat, in the appropriate context, ujij as if 
they were i.i.d. (0. cr^) and write the experimental error as 

三 f = (jj^j -|- 

with 

五 (_ _ 

var(eij) = al+<rl= al 
covicij^ej') = 0 (ij) # (i’j’). 

(vii) The overall error, experimental and observational, is 

— ^)ij 

with 

E(eij) = 0 
var(e 勿） =a 2 £ + 

and, as explained above, we can treat the such that 

cov(eij.ei'f) = 0 {ij) ^ (i'f). 

(yiii) Useful expressions for var(E-Cj^.) are 

var = ^cf(o-2 +cr 2)/ r 

i 

To make further inferences about treatment comparisons beyond point estimation 
we need to consider questions of interval estimation or tests of significance (see Sec¬ 
tions 6.5-6.7). 

6.4 ANALYSIS OF VARIANCE 

6.4.1 Deriving the ANOVA Table 

As indicated earlier, one of the most important tools for analyzing data from designed 
experiments is the analysis of variance. For data from a CRD, the analysis of variance 
(ANOVA) is based on the model 


Vij ~ M H - ^ij 


( 6 . 22 ) 
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as developed in Sections 6.3.3 and 6.3.4. In linear models terminology this is a one¬ 
way classification model and the analysis of variance table is as given in Table 6.2 (see 
also Section 4.12). 

We shall comment in some detail, mainly to lay the groundwork for future chapters, 
on the ANOVA Table 6.2 and its various parts. 

The number of entries (sources) is determined by the number of components, apart 
from /u, in the model. In this case the model is given by (6.22) and contains two terms, 
due to treatments (Tj) and error (e 勿 ). We note here that even if we had written (6.22) 
in its more explicit form (6.19), the number of entries in the ANOVA table would have 
been two, since for an actual data set the various error terms cannot be separated (see 
also Section 2.6). The corresponding partition of the (corrected) total sum of squares, 
SS(Total) can be obtained by writing the following identity: 

Vij = y.. + {Vi. - y.) + (yij - m.) 

or 

(yij - y..) = (Vi. - V..) + {yij - m.)- (6.23) 

Squaring both sides of (6.23) and summing over both subscripts, we obtain, using the 
fact that Eijiyi. - y..)(yij - yt.) = 0, 

- y-f = E(n ) 2 + Ylbhj - 勾 if 
ij ij ij 

or ^ 

- y..) 2 = r — V ..) 2 + [(yij - Vi .) 2 ； (6.24) 

ij i ij 

which is indeed 

SS(Total) = SS(T) + SS{E)., 

where SS(T) and SS(£) refer to the treatment and error sum of squares, respectively. 
The partition (6.24) exhibits two things: 

(i) If we substitute in each term on the right-hand side of (6,24) for the model 

(6.22), we recognize that — 历 .） 2 is a quadratic function in the e 勿 only 

and T^i(yi, — 穸 ”) 2 is a quadratic function in the and the , hence the names 
SS (Error) and SS (Treatments), respectively. 

(ii) Since 1^( 免 .— y,) — 0, this sum contains only 亡 一 1 (mathematically) indepen¬ 

dent terms which accounts for the t — 1 degrees of freedom (d.f.) associated with 
SS(r). Similarly, in - yt)} we have for every z, E〆 如一 仏 .） = 0. 

Hence each such sum contains r — 1 independent terms and hence the number 
of d.f. associated with SS(_E) is t(r — 1). 
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Turning to the expected mean squares, E(MS), the reader will notice that we have 
given two forms: one based on the assumption of additivity in the strict sense [model 
(6.8)], and the other based on the assumption of additivity in the broad sense [models 
(6.19) and (6.22)]. From a practical point of view, that is, for real data sets from a 
CRD, only the latter is important, but by exhibiting both forms we want to demonstrate 
their similarity and show that whether we use the covariance structure (6.11) and (6.12) 
of the or treat the tOij together with the uij and r]ij as i.i.d. random variables, we 
obtain equivalent results, that is, a\ in the first form is simply substituted by a\ = 
<7^ + = CTg + a^. This may be a subtle and philosophical point, but it is 

an important one in the transition from purely derived models to partly derived, partly 
assumed models. 


6.4.2 Obtaining Expected Mean Squares 

There are different methods of obtaining 五 (MS). One is to substitute for the ys in the 
expression for the mean square the linear model and then evaluate the expected value 
of that expression. We shall illustrate this for E[MS(T)] under model (6.8). We have 


Now, 


and 


SS(T) =r^(y,. - y..) 2 = r^(r I + 也 一 f - w..) 2 . (6.25) 


亍. = 


Q - = ^ s if uk 

ij ij k 


tr 


tr 


E E4 

k \ ij 
> : — 0 * 


Uk 


Then, (6.25) apart from the multiplier r can be expanded as follows: 
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+ ㉔ .) 2 = E f Ti + 去 E % 


H \ Ti+ 


Uk 


j k 


+ EE^ + 紅 

i i \ j k j ij k 

z 6 ij s ir u i 


Uk 


ij k 


3^3' k 


+ ^2 S iO S ij UkUk> + ^2 H S^d^UkUk' 

i j ky^k' i j 爹 j r k^k' 




(6.26) 


ij k 


Taking the expected value of the right-hand side of (6.26) and using the properties of 
the we obtain 

e r {e( t 《 +a.) 2 } = E r i 2 + ^ r -^E u fc 

l i ) i k 

+ ^ tr{r ~ 1) Nl^T)^ k UkUk， 

i k k 

= J2 T i + - 1 - r + 

i k 

= y^ji + 

i 


where we have used the fact that A r = tv, = 0, and Eu|/(A r — 1) = g\. It then 
follows that 


E r {MS(T)} 


■ J2(yi. -y.f/it- 1 ) 


Yl T i +<y u 


as given in Table 6.2. 

Another method to find 五 (MS) is to use the fact that for any random variable X, 
E(X 2 ) = var(X) + [E{X)] 2 . 
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We shall illustrate this method also for E[MS(T)] but this time under model (6.19). To 
this end, we consider 


Now 


and 


E{(yi. - y..) 2 } = var(y<. - y..) + [E(yi, - y..)} 2 . 


var(yj. - y..) 



i'W , 





(6.27) 


E{m. - y.) = n, 


(6.28) 


where we have used the fact that the can be treated as i.i.d. with mean zero and 
variance a\. It follows then immediately from (6.27) and (6.28) that 

E{MS(T)} = a 2 e + 


as given in Table 6.2. 

The expected value of MS(E) can be obtained similarly. It follows then from 
Table 6.2 that an estimator for a\ is 

= MS(£?). (6.29) 

And hence the standard error for the estimator of the treatment contrast Ec^ri is given by 


s.e. 



s.e. 


X 


CiVi. 



(6.30) 


As has been shown in Chapter 4, the ANOVA table can be used to test hypothe¬ 
ses about parametric functions in the context of the underlying linear model (see also 
Chapter 7). We shall examine the ideas of testing hypotheses about treatment effects 
using observations from the CRD in the next section. 
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6.5 STATISTICAL TESTS 

The mathematical description of the randomization process together with the assump¬ 
tion of treatment-unit additivity in the strict sense，has allowed us to derive a linear 
model the properties of which are determined by the very process. It also determines 
the properties of functions of the observation as we have seen, for example, in the anal¬ 
ysis of variance. We shall now go one step further and show how the randomization 
process, together with the analysis of variance, leads to a simple procedure for test¬ 
ing hypotheses about treatment effects. We shall give this development in some detail 
so that the reader can see immediately the application and extension to more complex 
designs to be discussed in later chapters. 

6.5.1 Enumerating Randomizations 

The outcome of our completely randomized assignment is that we have a plan associ¬ 
ating treatments to EUs. We then perform the experiment and obtain a data table which 
may look like this, for example, 


EU# 

1 

2 

3 .. 

. N 

Treatment 

3 

1 

4 •• 

. 7 

Response 

1/32 

yi3 

Vai _ • 

■ U7A 


where x/ij refers to the response for the jth application of treatment i. We wish to 
consider the hypothesis that the treatments have no differential effects, that is, we would 
observe the same response on an EU regardless of which treatment has been used. The 
experimental plan we have used is a random one of 


N\ 


_ 妙 

(r!)* (r!) t 


(6.31) 


possible plans (assignments). Let us name and index these plans by n 7 (7 = 1,2,.... s). 
Then if treatments were without differential effects, we would have obtained exactly 
the same result, that is, the same responses, for each n 7 . We can thus visualize data 
sets for the s plans as given in Table 6.3. In sketching this table we have inserted 
treatment plans that could have been used (the numbering is arbitrary). One of these 
plans is, of course, the plan we have actually used, say TIi. The observations from the 
experiment using Hi are labeled zi, Z 2 ^ . z n . Under the null hypothesis that there are 
no differences among the treatment effects the same observations would have been ob¬ 
tained from any of the other plans. For purposes of the analysis the observations would 
then have to be relabeled indicating for each Zi (£ = 1, 2,.. N) which treatment had 
been applied to the £-th EU. For example for IIx we would have zi = y 2 i ， 之 2 = ysi ， 
Z 3 = ysi, ..zj\r = y lr (assuming that the treatments are applied sequentially to the 


EU, starting with EU 1 and ending with EU N). 
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Table 6.3 Possible Outcomes for CRD 


EU# 

1 

2 

3 •• 

. N 

Response 

之 1 

幻 

zs • • 

. z N 

Plan 1 

2 

5 

3 . 

.. 1 

2 

1 

2 

2 .. 

. 4 

3 

3 

1 

4 .. 

. 2 

s 

1 

5 

1 •. 

. 4 


6.5.2 Randomization Test 

We now construct a criterion that is determined by the treatment plan whether actual or 
potential. A criterion depends on the treatment plan if and only if when the observations 
are indexed by (ij) with % denoting the treatment and j the replication (application) 
within the treatment, it is invariant under permutations of j within i. The variety of 
such criteria is, of course, essentially unlimited. In order to exposit the idea we give 
some examples: 

(i) the sum of squares for treatments, SS(T), in the ANOVA table ， 

(ii) MS(T)/MS(E), 

(iii) the range of treatment totals, 

(iv) the range of medians of the treatment groups (supposing there are no ties and an 
odd number of applications of each treatment), 

(v) the sum of squares (or range) of trimmed treatment means, 

(vi) the sum of squares (or range) of Winzorized treatment means, 

(vii) the sum of squares of robust estimates of treatment means. 

Having chosen a criterion C, say, we are able to evaluate C for all possible plans 
II 7 , giving us a set of numbers {C 7 ,7 = 1. 2,..., s}. Actually, there are only s* = 
s/(t\) different values C, since each permutation of the treatment labels yields the 
same value for C. We denote the 5* different values by C 7 *. Amongst these is the 
number associated with our actual plan, which we denote by C a . Then we declare 
that the significance level (SL) against the null hypothesis of no differential treatment 
effects, with the chosen criterion, is 

SL 二 - [number of C 7 () = 1, 2,..., 5 ) > C a ] (6.32a) 

SL = — [number of C 7 * (7* = 1.2,. .., s*) > C a ] (6.32b) 
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(Note that the number of C 7 on the right-hand side of (6.32a) and (6.32b) includes C a , 
so that always SL > 1/s*). 

We then assert: Under the null hypothesis of no differential treatment effects if a is 
an achievable level, which will be a multiple of 1/s*, the probability of obtaining a 
significance level less than or equal to a is a. This assertion is obvious since under the 
null hypothesis we are observing with probability 1/s* one of the numbers of the set 
{C 7 *,7* = 1 ， U}. ^ ' 

The procedure just described is called the randomization test (see Chapter 5). We 
illustrate it with the following example. 


EXAMPLE 6.4: Suppose we have t = 3 treatments and r = 3 replications for each 
treatment. Let our plan, IIi say，and the observed responses be as follows: 

EU 1 2 3 4 5 67 8 9 

Trt 1 3 2 1 2 1 3 3 2 

y 7.58 11.61 9.97 8.56 11.03 8.82 10.32 11.73 10.06 

This plan is one out of s = 9!/(3!) 3 = 1680. Using C = SS(T) as our test 
criterion we obtain C a = SS(T)i = 13.2956. This is one of s* — 1680/6 = 280 
different values for all possible values. It is not difficult to write a computer program 
to enumerate all possible 1680 plans and their associated 280 different SS(T)-values. 
As a result we obtain for the significance level using (6.32a) or (6.32b) 


SL = 


12 

1680 = 


2 

280 


.00714 


Inspection of the plan IIi above suggests that in this case it is not necessary to spell out 
all different plans in order to obtain SL: it is clear that the largest SS(T)-value, using 
the ^-values above, is obtained from the plan 


EU 123456789 
Trt 1 3 2 1 3 1 2 3 2 


which has Treatment 1 associated with the 3 lowest observations and Treatment 3 with 
the 3 highest observations, providing thus the largest mean separation among the treat¬ 
ments and hence the highest SS(T), namely 14.8623. The plan above is obtained 
from the plan IIi by simply interchanging the treatments assigned to EUs 5 and 7 and 
thereby interchanging the third and fourth highest observations. As a consequence IIi 
then leads to the second largest mean separation and hence to the second largest SS(T). 
Since each plan has five additional permutations with the same SS(T) the result for SL 
as given above follows immediately. □ 

Unfortunately, such simple arguments are not always possible. This means that 
usually all plans have to be enumerated and each (7- value computed in order to obtain 
the significance level. 
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6.6 APPROXIMATING THE 
RANDOMIZATION TEST 


We saw in the previous section that even for small t and r the number of possible plans 
s under randomization is quite large. Further evidence of the rapid increase in s for 
moderate values of t and r is given in Table 6.4. 

These numbers indicate that even though it is possible today, in the computer age, 
to spin out all possible plans and proceed with the randomization analysis as described 
in Section 6.5, the procedure becomes rather cumbersome. It is in this context that 
we shall discuss an approximation to the randomization test by the F-test (see also 
Kempthome, 1955) as suggested by the GMNLM theory (see Chapter 4). 


6.6.1 Moments of the Test Statistic 


We note first that under the null hypothesis, Hq : ri = T 2 — .r t = 0 (remember 
from Section 6.3.1 that [ 7^ = 0), using the results of Sections 6.2.2, 6.3.1，and 6.4.2 


Table 6.4 Number of Experimental Plans 


Treatments 

⑷ 

Number of 
Replications 

(r) 

Plans 

(s) 

Different C-Values 

(，） 

2 

4 

70 

35 


5 

210 

105 


6 

924 

462 


7 

3,432 

1,716 

3 

3 

1,680 

280 


4 

34,650 

5,775 


5 

252,252 

42,042 

4 

2 

2,520 

105 


3 

369,600 

15,400 


4 

63,063,000 

2,627,625 

5 

2 

113,400 

945 


3 

168, 168,000 

1,401,400 
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SS(T) + SS(E) = SS(Total) 


= 公斯 -歹 ..) 2 
ij 

= 


ij \ k 


2 


= J2J2 S iJ u2 k + J2Y1 S iAj u kUk' 

ij k ij k 袖 ’ 

=^2 u l 

k 


is a constant and, by definition, equal to (N - 1)(7^. Instead of using SS(T) as our test 
criterion we could, therefore, just as well consider 


__ss(r^_ 
- ss(r) + ss(E) ‘ 

We know that under GMNLM theory the quantity 


r 二 MS(T) 

— MS(E) 

is distributed as (see Chapter 4). It is then a fact that 

(t - 1)F _ SS(T) 

t{r - 1) + ( 卜 1)F = SS(r) + SS(E)= 

follows a beta distribution with density 


f{z)dz = 





dz 


(6.33) 


for 0 < 2 < 1, that is, a beta (a, 3) with a = (t — 1)/2, (3 = t(r — 1)/2 (see Johnson, 
Kotz and Balakrishnan, 1995, p.327). From the properties of the beta distribution we 
know that if a random variable X is beta (a, /3), then 


F(X k ) = _ (a + k - 1 )! (q + - 1 )! 

1 卜 ~ (a + /3 + /c- 1)! ' (a"-1)! — 


In particular, we have 



176 


CHAPTER 6. COMPLETELY RANDOMIZED DESIGN 


E(X 2 ) 


、广 a + 

a(a + 1) 

(a + /3)(a + /3 + l). 


We now consider the first two moments of Z, our test criterion, under GMNLM theory 
and randomization theory. Since Z, as defined in (6.33), is beta [(t-1)/2, t(r —1)/2)}, 
it follows from (6.34) that 


and from (6.35) that 


E{Z 2 ) 


y tr - 1 

(t — 1)( 亡 + 1) 

(tr — l)(tr + 1 ) ’ 


We now consider Er(Z) and Er(Z 2 ). We have already shown (see Section 6.4) that, 
under the null hypothesis 


E r [SS(T)\ = (t- l)a 2 u . 


It follows then that 


Er(Z) 


tr — 1 


To work out 五只 [SS(T)] 2 and hence Er(Z 2 ) is rather tedious and lengthy. We shall 
not give a derivation but use a result from Richards (1980) which yields 


E r [SS(T)} 


'2(t-l)(r-l)(N 2 -3N+ 3) {N - r) 2 
r(N - 1) 2 (N - 2)(N - 3) + r 2 (n — l ) 2 

2iV(r — l)(t — 1) ^ 4 

r(N - l)(iV - 2)(N - 3) ^ k ' 


For large r, in the sense that 1/r and hence \IN are small compared to 1, (6.39) can be 
approximated by 


f-r 2 

[E< 

m u) 2 〆. 


It follows then that 


Er{Z 2 ) 


(t - l)(t + 1 ) 
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6.6.2 Approximation by the F-Test 

Comparing (6.38) with (6.36) and (6.40) with (6.37) we can say that the distributions 
of Z under normal theory and under randomization theory are in good agreement with 
respect to the first two moments. We take this to mean that the randomization distri¬ 
bution of Z is “fairly accurately” represented by the beta distribution. This implies 
that the randomization distribution of MS(T)/MS(£^) is fairly accurately represented 
by the F-distribution with t - 1 and t(r - 1) d.f. It is in this sense that we consider 
the ordinary F-test for testing the hypothesis of no differences among the treatment 
effects as a good approximation to the randomization test discussed in Section 6.5, We 
mention here that these ideas and results go back to Fisher (1935), Pitman (1937), and 
Welch (1937). 

It should be pointed out that the preceding discussion was based entirely on the 
derived linear model based on the assumption of additivity in the strict sense. As 
discussed earlier (see Section 6.3) for practical applications the model obtained under 
the assumption of additivity in the broad sense is more realistic. The question then 
arises: How should we test the hypothesis of no treatment differences under this model? 
There does not seem to be an easy way, if any at all, to derive a result for model (6.22) 
analogous to the one just derived for model (6.8). Nevertheless, we take the results of 
this section as a strong indication that the usual F-test as suggested by the ANOVA is 
an appropriate test procedure. 


6.6.3 Simulation Study 

The arguments given above for suggesting that the randomization test can be approx¬ 
imated by the usual F-test is of course, not entirely satisfactory. The following ques¬ 
tions remain: (i) What happens for small r, and (ii) to what extent does agreement 
of the first two moments imply agreement of both distributions? Although we cannot 
provide an analytical solution, we can give some indication that, indeed, the agreement 
between both distributions is quite good in general. 

We shall illustrate the argument in terms of a simple example. Suppose we have 
t = 4 ? r = 8. The total number of randomizations is s = 2.39 x 10 24 . Except 
for very powerful computers it is a nearly impossible task to enumerate all possible 
randomizations as described in Section 6.5, hence approximation of the randomization 
test by the F-test appears to be the only practical solution. To demonstrate that this is, 
indeed, a reasonable approach we conduct the following simulation experiment. Out 
of all possible randomizations we select at random s f = 500 randomizations of the 
4 treatments to the 32 EUs. Assigning the (arbitrarily chosen) responses 0, 1, 2, 3, 
to 8 EUs each we then compute for each of the 〆 arrangements the quantity F = 
MS(T)/MS(£：), denoted by 尸⑴， P 2 ), ..., 尸 ( 500 ). For each = 1,2,..., 500) 
we then obtain the significance level in two ways: 

(i) based on the F-distribution with 3 and 28 d.f., 

(ii) based on the rank among all 500 jF- values as explained in (6.32a). 


We denote these significance levels by NSL and RSL, respectively. A plot of RSL 
vs. NSL is given in Figure 6.1. It shows that both significance levels are in “close” 
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Table 6.5 Comparison of Significance Levels 


Rank 

F-Value* 

NSL 

RSL 

1 

8.65863 

0.00032 

0.002 

2 

6.55319 

0.00170 

0.004 

3 

4.88889(2) 

0.00740 

0.008 

5 

4.62305(2) 

0.00949 

0.012 

7 

4.49383(4) 

0.01073 

0.020 

11 

4.36697 

0.01211 

0.022 

12 

4.12012(3) 

0.01537 

0.028 

16 

3.65217 

0.02436 

0.032 

17 

3.43020(2) 

0.03045 

0.036 

19 

3.32203 

0.03399 

0.038 

20 

3.21569 

0.03788 

0.040 

21 

3.00826 

0.04691 

0.042 

22 

2.80759(6) 

0.05781 

0.054 

28 

2.70968(2) 

0.06406 

0.058 

30 

2.61333(6) 

0.07091 

0.070 


* Values in parentheses indicate frequency of occurrence. 


agreement, but the line with slope 1 through the points indicates that RSL is (with 
some exceptions) always slightly larger than NSL. Some indication of the discrepancy 
for small significance levels is given in Table 6.5. Some of the “large” discrepancies 
are, of course, due to the discreteness of RSL and the fact that the same i 7 -value may 
occur more than once. From a practical point of view, however, the agreement between 
NSL and RSL is quite remarkable. 

In the discussion above, the reader should keep in mind that this is only one example 
intended to illustrate a point, namely to support the theoretical result that, under certain 
conditions, the randomization distribution can be approximated by the F-distribution. 
This is, of course, only the beginning of what would have to be an extensive Monte 
Carlo study, using different values for t and r and for the responses y. Suffice it to say 
here that for a number of different values similar results were obtained giving plausible 
credence to the validity of our assertion (see also Kempthorne and Doerfler, 1969). 
Hence, from now on we shall use the F-test as an approximation to the randomization 
test for testing the hypothesis Hq : 丁丄 二丁 2 = … = 丁 t. 

We conclude this discussion by pointing out that in most cases the null hypothesis 
of the equality of treatment effects is not the most important hypothesis to test. For a 
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Figure 6.1 Plot of Significance Levels of Randomization Test vs. Approximating 
F-Test. 


well designed interested experiment the researcher is often more interested in specific 
treatment comparisons. Such questions will be discussed in more detail in Chapter 7. 
This does not diminish the importance of the ANOVA as a data analysis technique, as 
one of its useful functions is the estimation of the error variance component or as 
shown in Section 6.9 the error variance components (j 2 e and cr^. 

6.7 CRD WITH UNEQUAL NUMBERS OF 
REPLICATIONS 

Although in most applications of the CRD each treatment will be replicated the same 
number of times, namely r, it is not uncommon to have unequal numbers of replica¬ 
tions, say Vi for the zth treatment (z = 1,2,..., t). We may have, for example, the 
situation that one treatment, say treatment 1, is the control or standard with which we 
wish to compare the other treatments [see also Section 7.5.7]. It seems then reasonable 
to obtain especially good information about treatment L that is, to have more repli¬ 
cations for treatment 1 than for the other treatments. Or，it may be that among the t 
treatments some are more important than others, suggesting different numbers of repli¬ 
cations for the two sets of treatments. Another reason for having unequal r/s may 
simply be the fact that the observations from some EUs may be missing (independent 
of the treatments). 






180 


CHAPTER 6. COMPLETELY RANDOMIZED DESIGN 


6.7.1 Randomization 

Just as for the case of the equireplicate CRD, the randomization procedure can be per¬ 
formed by using random numbers (see Section 6.2.1). From a practical point of view ， 
the process can be implemented by using appropriate statistical software, e.g. SAS 
PROC PLAN (SAS Institute, Inc., 2002—2003). We illustrate this with the following 
example. 

Example 6.5: Suppose we consider a CRD with t = 4, r ： = 4, = 2. 

The SAS input statements and the output are given in Table 6.6. □ 


6.7.2 The Model and ANOVA 

It is obviously much more complicated to derive the randomization analysis for the 
situations described above. The basic model, however, is still model (6.22), 

Vij ~ + A + 

with i = 1,2.... ,t:j = 1,2,... ,rj. The ANOVA table for equal numbers of replica¬ 
tions (Table 6.2) is modified easily to accommodate unequal numbers of replications 
and is given in Table 6.7. 

Just as for the equal number case (Section 6 . 6 ) it can be illustrated through Monte 
Carlo studies that the randomization test for testing Hq : 丁 1 = 丁 2 = … = 丁 t can be 
approximated by the F -test 

" MS(T) 

— MS(E) 

with t — l and (Srj — t)d.f. 

6.7.3 Comparing Randomization Test and F-Test 

To illustrate the general agreement between the significance levels for both tests, that 
is, RSL and NSL, we give below the result for a simulation run for ^ = 4 and r! = r 2 = 
4, rs = r 4 = 2. In Figure 6.2 we show the relationship between RSL and NSL based 
on a random sample of 1000 randomizations, using the same procedure as described 
in Section 6 . 6 . Obviously, a much broader simulation study would have to be done to 
give more support to our claim that for the unequal replication CRD the randomization 
test can be approximated by the F-test, but a small number of simulations have led to 
results similar to those given in Figure 6.2. We found, in general, a good agreement 
between RSL and NSL for random samples of 1,000 randomizations as illustrated in 
Figure 6.2. Note also that these results are similar to those given in Figure 6.1. Obvi¬ 
ously when the experiment is small the approximation may not be good, but then the 
randomization test can be done easily on a computer. 

6.8 NUMBER OF REPLICATIONS 

One question that is being asked often, and it is an important question, is: How many 
replications are needed for each treatment? The reason for asking this question (which 
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Table 6.6 Randomization Procedure for CRD with UnequalReplications 

a. 〉 Input Statements : 

proc plan seed=13396; 
factors unit=10; 

treatments treat=10 cyclic (1111223344); 
output out=CRD; 

titlel r COMPLETELY RANDOMIZED DESIGN'; 
title2 'WITH UNEQUAL REPLICATIONS'; 
title3 r (t=4, rl=4 r2=r3=r4=2, N=10)'; 
run; 

proc sort out=CRD; 

by unit; 

run; 

proc print; 
run; 

b. ) Output : 

COMPLETELY RANDOMIZED DESIGN 
WITH UNEQUAL REPLICATIONS 
(t=4, rl=4 r2=r3=r4=2, N=10) 

The PLAN Procedure 

Plot Factors 

Factor Select Levels Order 

unit 10 10 Random 


Treatment Factors 


Factor Select Levels Order Initial Block / Increment 

treat 10 10 Cyclic (1111223344) / 1 

- unit - - treat - 

843 10 915762 1111223344 
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Table 6.6 (Continued) 

Obs unit treat 

112 

2 2 4 

3 3 1 

4 4 1 

5 5 3 

6 6 4 

7 7 3 

8 8 1 

9 9 2 

10 10 1 


Table 6.7 ANOVA for CRD with Unequal Numbers of Replications 

Source 

d.f. 

ss 

MS 


五 (MS) 

Treatments 

t - 1 

T ， r i(vi- - y ：) 2 

i 

Ms(r) 



Error 

E r i - t 

i 

T.(yu - Vi .) 2 

MS{E) 



Total 

En - 1 

i 

Yl(yij - y ..) 2 

Uj 





is often referred to — incorrectly — as a question of “sample size”）is to “assure” that 
the experiment is sensitive enough to detect differences among the treatments if there 
are any, that is, to reject the null hypothesis of no treatment differences in the ANOVA 
F-test. As it stands, however, the question cannot be answered without further input 
specific to the particular investigation at hand. 


6.8.1 Power of the F-Test 

Based on our discussion above we shall use the normal (Gaussian) independent error 
model and the associated central and noncentral F-distributions to examine this ques¬ 
tion, using the notion of the power of the F-test. More specifically, the sensitivity or 
power of the F-test, denoted by 1 — .5, where 8 is the probability of a Type II error, 
depends on 
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Figure 6.2 Plot of Significance Levels of Randomization Test (rsl) versus Approx¬ 
imating F-Test (nsl). 

(i) the size of the test, that is, probability of the Type I error, a; 

(ii) the degrees of freedom, t — 1 and t(r — 1); 

(iii) the noncentrality parameter 


A = 


i 

如 i 


(6.41) 


of the noncentral F-distribution, where the are the true values of the treatment effects 
as specified under the alternative hypothesis. The general procedure then is to specify 
a. 1 一 （3, and X/r and ask: How many replications, r, are needed to detect, with 
probability 1 — (3, treatment differences as specified by X/r if we use a test of size a? 

It is, of course, not difficult to specify a and 3. We usually take a = .05 or a = .10 
as those seem to be reasonable values for the risk of committing a Type I error, that is, 
concluding that there are differences among the treatments, when in fact there are none. 
A bit more difficult is the choice of (3 or rather 1-/J, the probability for concluding that 
there are differences among the treatments when they indeed exist. From a practical 
point a reasonable choice is 1 —= .80 although this choice, as all the others, depends 
on the particular problem under consideration. By far the most difficult choice is that 
of X/r, because that, after all, represents the true state of nature, something we do not 
know. How does one get out of that dilemma? 
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6.8.2 Smallest Detectable Difference 


It is at this point that the subject matter knowledge of the investigator becomes very 
important. Since we do not know the true state of nature, we might ask: What mini¬ 
mum difference between the two extreme treatments, the best and the worst，is worth 
detecting with probability of at least 1 — /3, if such a difference exists? This question 
is best visualized for the simple case t = 2. Suppose we want to compare two drugs ， 
an established drug A versus an experimental drug B expected to be better than A. 
How much greater has the therapeutic effect of B have to be before it is worth further 
development and marketing? There is obviously a minimum difference in therapeutic 
effects before B is worth developing, both from a medical as well as financial point 
of view. 

For the general case let us denote the difference between the largest treatment ef¬ 
fect, r max , and the smallest treatment effect, r min , by 

△ 二 『max — ^"min - (6.42) 


For any set of (i — 1,2. ..., satisfying (6.42) the smallest A/r is obtained when 
the remaining t — 2 treatment effects T{ are equal to (T max + r m i n )/2. Since the are 
defined such that = 0, this means that 


Tmax = 




It then follows from (6.41) that 


Tj = 0 otherwise. 



4 


△ 2 

4 ^ 2 * 


(6.43) 


(6.44) 


It is well known that the power of the F-test is an increasing function of A. Hence 
the power of the F-test with A given by (6.44) has the smallest value for all situations 
subject to (6.42). Tables and charts for the power of the F-test in terms of A (or suitable 
functions of A) and d.f. v\ = t — 1 and = t(r — 1) are available (Tang, 1938; Pearson 
and Hartley, 1970; Odeh and Fox, 1975) and can be used to obtain iteratively a suitable 
value for r, given a. 1 —/?,△ ， g\. For a description of this procedure we refer the 
reader to Scheffe (1959). 

A more convenient set of tables was developed specifically for the determination 
of the number of replications by Bowman and Kastenbaum (1975). They follow essen¬ 
tially the same arguments as given above but in terms of 




Tmax 一 ^min 


Tmax — ^min 


(6.45) 


the standardized minimum difference between the two extreme treatment effects (this 
is sometimes referred to as the effect size). It is quite often easier to specify rather 
than A, and specifying A* absolves one from also specifying Selected parts of the 
tables from Bowman and Kastenbaum (1975) are reproduced in Table 6.8 for t = 2, 
3, 4, 5, 6, 8, 9, 10, 11 ， 13, 15, 20, 25, 30, and a 二 .05, 1 - 3 = .7, .8, .9, and 
r = 2, 3, ...， 25. 
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6.8.3 Practical Considerations 

To show the use of these tables and the sometimes surprising results concerning the 
magnitude of r, we consider the following example. 

EXAMPLE 6.6: Suppose we have t = 3, a = .05. For different values of 1 — /3 = .7, 
.8, .9 and A* = 1.0, 1.5, 2.0, we obtain from Table 6.8 the following values for r given 
in Table 6.9 (since the exact values for A* are not represented in Table 6.8 we choose 
the next smaller value for A* that is represented in Table 6.8). □ 


Table 6.9 illustrates a number of points: 


(i) In some cases the number of replications seems surprisingly large. This often 
leads to disappointment when an experiment has been carried out without prior 
consideration of the number of replications. Generally too few replications are 
chosen resulting in a low power of the F-test which means that much effort may 
have been wasted. 

(ii) To detect small differences requires relatively more replications, in fact it may 
require more replications than is practical. It is therefore important to arrive at a 
realistic value for A*. 

(iii) As the probability 1 — of detecting an existing difference increases so does r, 
and in certain situations at an appreciable rate. 


An alternative to using Table 6.8 is to use the Power Procedure in SAS (SAS In¬ 
stitute, Inc., 2002-2003). To use this procedure, rather than specifying A* of (6.45) 
we need to specify the or // H- (i = 1, 2, .. t) under the alternative hypothesis 
(referred to as group means in the procedure) and the standard deviation <7 e . In order 
to obtain the same results as described above in connection with Table 6.8, we specify, 
for given the as in (6.43) with A = A* and a e = 1. We illustrate this procedure 
in the following example. 

Example 6.6 (continued): Using 亡 = 3, a = .05, /? = .9, and △* = 1， the SAS 
PROC POWER input is given in Table 6.10a and the output in Table 6.10b. The result 
is r = 27 (labeled N Per Group in the output), complementing the value (> 25) in 
Table 6.9. □ 

The conclusion one should draw from this discussion is that it is important to 
present the investigator with a table like Table 6.9 prior to the experiment to explain 
the options available. It is not so important to decide whether one needs 20 or 21 
replications, but rather that one needs about 20 and not 10 replications, for example. 

If the investigator is more comfortable in assigning a value to A rather than A*, we 
need, of course, some information about to determine r. Sometimes this information 
is available from previous similar experiments. In other cases one may have to do a 
preliminary study to estimate This estimate may be the MS(E) from the ANOVA 
table for the preliminary study or simply an estimate of the variance using just one 



186 


CHAPTER 6. COMPLETELY RANDOMIZED DESIGN 


Table 6.8 Values of A* to Determine Numbers of 
Replications CRD* 


t = A 




1-/3 



1-3 



1-/3 


r 

.7 

.8 

.9 

.7 

.8 

.9 

n 

.8 

.9 

2 

4.863 

5.653 

6.796 

4.883 

5.570 

6.548 

4.872 

5.504 

6.395 

3 

2.703 

3.071 

3.589 

2.957 

3.325 

3.838 

3.094 

3.460 

3.967 

4 

2.104 

2.381 

2.767 

2.335 

2.618 

3.010 

2.468 

2.754 

3.148 

5 

1.792 

2.024 

2.348 

1.997 

2.236 

2.568 

2.119 

2.362 

2.698 

6 

1.590 

1.796 

2.081 

1.775 

1.987 

2.280 

1.888 

2.104 

2.401 

7 

1.446 

1.632 

1.890 

1.615 

1.808 

2.073 

1.719 

1.916 

2.186 

8 

1.335 

1.507 

1.745 

1.492 

1.670 

1.915 

1.590 

1.771 

2.020 

9 

1,247 

1.407 

1.629 

1.394 

1.560 

1.788 

1.486 

1.655 

1.888 

10 

1.175 

1.325 

1.534 

1.313 

1.469 

1.684 

1.400 

1.559 

1.778 

11 

1.113 

1.256 

1.454 

1.245 

1.393 

1.596 

1.328 

1.479 

1.686 

12 

1.061 

1.197 

1.385 

1.186 

1.327 

1.521 

1.266 

1.409 

1.607 

13 

1.016 

1.145 

1.326 

1.135 

1.270 

1.456 

1.211 

1.349 

1.538 

14 

0.975 

1.100 

1.273 

1.090 

1.220 

1.398 

1.164 

1.296 

1.478 

15 

0.940 

1.060 

1.226 

1.050 

1.175 

1.347 

1.121 

1.249 

1.424 

16 

0.908 

1.024 

1.185 

1.015 

1.135 

1.301 

1.083 

1.206 

1.375 

17 

0.879 

0.991 

1.147 

0.982 

1.099 

1.259 

1.049 

1.168 

1.331 

18 

0.852 

0.961 

1.112 

0.953 

1.066 

1.222 

1.017 

1.133 

1.292 

19 

0.828 

0.934 

1.081 

0.926 

1.036 

1.187 

0.988 

1.101 

1.255 

20 

0.806 

0,909 

1.052 

0.901 

1.008 

1.155 

0.962 

1.071 

1.222 

21 

0.786 

0.886 

1.025 

0.878 

0.982 

1.126 

0.938 

1.044 

1.191 

22 

0.767 

0.865 

1.000 

0.857 

0.959 

1.099 

0.915 

1.019 

1.162 

23 

0.749 

0.845 

0.977 

0.837 

0.936 

1.073 

0.894 

0.996 

1.135 

24 

0.733 

0.826 

0.956 

0.819 

0.916 

1.050 

0.874 

0.974 

1.110 

25 

0.717 

0.809 

0.936 

0.802 

0.897 

1.028 

0.856 

0.953 

1.087 
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Table 6,8 (Continued) 

t = Q 




1-/3 



1-/3 



1-/3 


r 

.7 

.8 

.9 

.7 

.8 

.9 

.7 

.8 

.9 

2 

4.889 

5.490 

6.333 

4.922 

5.505 

6.317 

4.963 

5.534 

6.327 

3 

3.197 

3.562 

4.065 

3.283 

3.647 

4.149 

3.358 

3.723 

4,224 

4 

2.568 

2.856 

3.251 

2.650 

2.940 

3.337 

2.721 

3.013 

3.412 

5 

2.211 

2.457 

2.795 

2.287 

2.535 

2.876 

2.352 

2.602 

2.945 

6 

1,973 

2.191 

2.492 

2.042 

2.264 

2.567 

2.102 

2.326 

2.632 

7 

1.798 

1.997 

2.271 

1.863 

2.065 

2.341 

1.919 

2.123 

2.401 

8 

1.664 

1.848 

2.100 

1.725 

1.911 

2.166 

1.777 

1.965 

2.223 

9 

1.556 

1.728 

1.963 

1.613 

1.787 

2.026 

1.662 

1.839 

2.080 

10 

1.466 

1.628 

1.850 

1.521 

1.685 

1.910 

1.568 

1.734 

1.961 

11 

1.391 

1.544 

1.755 

1.443 

1.599 

1.812 

1.488 

1.645 

1.861 

12 

1.326 

1.472 

1.673 

1.376 

1.524 

1.727 

1.419 

1.569 

1,774 

13 

1.269 

1.409 

1.602 

1.317 

1.459 

1.654 

1.358 

1.502 

1.699 

14 

1.220 

1.354 

1.539 

1.266 

1.402 

1.589 

1.305 

1.444 

1.633 

15 

1.175 

1.305 

1.483 

1.220 

1.351 

1.531 

1.258 

1.391 

1.573 

16 

1.135 

1.261 

1.432 

1.178 

1.306 

1.479 

1.216 

1.344 

1.520 

17 

1.099 

1.221 

1.387 

1.141 

1.264 

1.433 

1.177 

1.302 

1.472 

18 

1.066 

1.184 

1.345 

1.107 

1.226 

1.390 

1.142 

1.263 

1.428 

19 

1.036 

1.151 

1.307 

1.076 

1.192 

1.351 

1.110 

1.228 

1.388 

20 

1.009 

1.120 

1.273 

1.047 

1.160 

1.315 

1.081 

1.195 

1.351 

21 

0.983 

1.092 

1.240 

1.021 

1.131 

1.282 

1.053 

1.165 

1.317 

22 

0.960 

1.065 

1.210 

0.996 

1.104 

1.251 

1.028 

1.137 

1.285 

23 

0.938 

1.041 

1.183 

0.973 

1.078 

1.222 

1.004 

Ull 

1.256 

24 

0.917 

1.018 

1.157 

0.952 

1.055 

1.195 

0.982 

1.086 

1.228 

25 

0.898 

0.997 

1.132 

0.932 

1.033 

1.170 

0.962 

1.064 

1.203 
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Table 6.8 {Continued) 




t = 8 



t = 9 



t = 10 




1-(3 



1 — 





r 

.7 

.8 

.9 

.7 

.8 

.9 

.7 

.8 

.9 

2 

5.009 

5.572 

6.350 

5.056 

5.613 

6.382 

5.104 

5.657 

6.419 

3 

3.426 

3.791 

4,293 

3.488 

3.854 

4.356 

3.545 

3.913 

4.416 

4 

2.784 

3.078 

3.479 

2.841 

3.136 

3.540 

2.893 

3.191 

3.596 

5 

2.409 

2.662 

3.008 

2.461 

2.716 

3.064 

2.509 

2.766 

3.116 

6 

2.155 

2.381 

2.689 

2.203 

2.431 

2.741 

2.247 

2.477 

2.789 

7 

1.968 

2.174 

2,455 

2.013 

2.221 

2.504 

2.054 

2.263 

2.548 

8 

1.823 

2.014 

2.274 

1.865 

2.057 

2.319 

1.903 

2.097 

2.361 

9 

1.706 

1.884 

2.128 

1,746 

1.926 

2.171 

1.782 

1.963 

2.210 

10 

1.609 

1.777 

2.006 

1.647 

1.816 

2.048 

1.681 

1.852 

2.085 

11 

1.527 

1.687 

1.904 

1.563 

1.724 

1.943 

1.596 

1.758 

1.979 

12 

1.457 

1.609 

1.816 

1.491 

1.644 

1.853 

1.522 

1.677 

1.888 

13 

1.395 

1.540 

1.739 

1.428 

1.575 

1.775 

1.458 

1.606 

1.808 

14 

1.340 

1.480 

1.671 

1.372 

1.513 

1.706 

1.401 

1.544 

1.738 

15 

1.292 

1.427 

1.611 

1.323 

1.459 

1.644 

1.351 

1.488 

1.675 

16 

1.248 

1.379 

1.556 

1.278 

1.410 

1.589 

1.305 

1.438 

1.619 

17 

1.209 

1.335 

1.507 

1.238 

1.365 

1.539 

1.264 

1.393 

1.568 

18 

1.173 

1.295 

1.462 

1.201 

1.325 

1.493 

1.227 

1.351 

1.521 

19 

1,140 

1.259 

1.421 

1.167 

1.288 

1.451 

1.192 

1.314 

1.479 

20 

1.110 

1.226 

1.384 

1.136 

1.253 

1.413 

1.161 

1.279 

1.440 

21 

1.082 

1.195 

1.349 

1.108 

1.222 

1.377 

1.131 

1.247 

1.403 

22 

1.056 

1.166 

1.316 

1.081 

1.193 

1.344 

1.104 

1.217 

1.370 

23 

1.032 

1.139 

1.286 

1.057 

1.165 

1.313 

1.079 

1.189 

1.338 

24 

1.009 

1.114 

1.258 

1.033 

1.140 

1.285 

1.056 

1.163 

1.309 

25 

0.988 

1.091 

1.232 

1.012 

1.116 

1.258 

1.033 

1.139 

1.282 






6.8. NUMBER OF REPLICATIONS 


189 


Table 6.8 {Continued) 




t = 11 

1-3 



亡 =13 
1-0 



t = 15 
1-/3 


r 

.7 

.8 

.9 

.7 

.8 

.9 

.7 

.8 

.9 

2 

5.152 

5.702 

6.458 

5.245 

5.792 

6.541 

5.334 

5.879 

6.625 

3 

3.599 

3.968 

4.472 

3.697 

4.069 

4.576 

3.785 

4.161 

4.670 

4 

2.942 

3.241 

3.649 

3.030 

3.333 

3.744 

3.109 

3.415 

3.830 

5 

2.553 

2.812 

3.164 

2.633 

2.895 

3.251 

2.705 

2.970 

3.329 

6 

2.288 

2.519 

2.834 

2.361 

2.596 

2.914 

2.426 

2.664 

2.986 

7 

2.091 

2.303 

2.590 

2.160 

2.374 

2.665 

2.220 

2.437 

2.732 

8 

1.939 

2.134 

2.400 

2.002 

2.201 

2.470 

2.059 

2.260 

2.533 

9 

1.815 

1.998 

2.247 

1.875 

2.061 

2.313 

1.929 

2.117 

2.372 

10 

1.713 

1.885 

2.120 

1.770 

1.945 

2.183 

1.820 

1.998 

2.239 

11 

1.626 

1.790 

2.012 

1.680 

1.847 

2.073 

1.728 

1.897 

2.126 

12 

1.551 

1.707 

1.920 

1.603 

1.762 

1.977 

1.649 

1.810 

2.029 

13 

1.486 

1.635 

1.839 

1.536 

1.688 

1.894 

1.580 

1.734 

1.944 

14 

1.428 

1.572 

1.767 

1.476 

1.622 

1.821 

1.519 

1.667 

1.868 

15 

1.376 

1.515 

1.704 

1.423 

1.564 

1.755 

1.464 

1.607 

1.801 

16 

1.330 

1.464 

1.646 

1.375 

1.512 

1.696 

1.415 

1.554 

L741 

17 

1.288 

1.418 

1.595 

1.332 

1.464 

1.643 

1.371 

1.505 

1.686 

18 

1.250 

1.376 

1.547 

1.293 

1.421 

1.594 

1.330 

1.460 

1.636 

19 

1.215 

1.338 

1.504 

1.257 

1.381 

1.550 

1.293 

1.420 

1.591 

20 

1.183 

1.302 

1.464 

1.223 

1.345 

1.509 

1.259 

1.382 

1.549 

21 

1.153 

1.270 

1.427 

1.193 

1.311 

1.471 

1.228 

1.348 

1.510 

22 

1.126 

1.239 

1.393 

1.164 

1.279 

1.436 

1.198 

1.315 

1474 

23 

1.100 

1.211 

1.361 

1.138 

1.250 

1.403 

1.171 

1.285 

1.440 

24 

1.076 

1.184 

1.332 

1.113 

1.223 

1.373 

1.145 

1.257 

1.409 

25 

1.053 

1.160 

1.304 

1.090 

1.197 

1.344 

1.122 

1.231 

1.379 
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Table 6.8 (Continued) 




t = 20 

1-/3 



t = 25 

1-3 



t = 30 
1-/3 


r 

.7 

.8 

.9 

.7 

.8 

.9 

.7 

.8 

.9 

2 

5.539 

6.086 

6.829 

5.722 

6.272 

7.018 

5.886 

6.441 

7.191 

3 

3.977 

4.359 

4.877 

4.138 

4.527 

5.053 

4.279 

4.674 

5.208 

4 

3.278 

3.592 

4.015 

3.419 

3.739 

4.171 

3.542 

3.868 

4.307 

5 

2.856 

3.129 

3.497 

2.983 

3.261 

3.637 

3.092 

3.376 

3.758 

6 

2.565 

2.810 

3.139 

1 2.681 

2.931 

3.268 

2.780 

3.036 

3.379 

7 

2.349 

2.572 

2.874 

2.455 

2.684 

2.993 

2.548 

2.781 

3.095 

8 

2.179 

2.386 

2.666 

2.279 

2.491 

2.777 

2.365 

2.582 

2.874 

9 

2.042 

2.236 

2.498 

2.136 

2.335 

2.603 

2.217 

2.420 

2.694 

10 

;1.928 

2.111 

2.359 

2.017 

2.205 

2.458 

2.094 

2.286 

2.544 

11 

1.831 

2.005 

2.240 

1.916 

2.094 

2.335 

1.989 

2.171 

2.417 

12 

1.747 

1.913 

2.138 ! 

1.829 

1.999 

2.228 . 

1.899 

2.073 

2.307 

13 

1.674 

1.833 

2.048 : 

1.752 

1.916 

2.135 

1.820 

1.986 

2.211 

14 

1.610 

1.763 

1.969 

1.685 

1.842 

2.053 

1.750 

1.910 

2.126 

15 

1.552 

1.700 

1.899 

1.625 

1.776 

1.980 

1.687 

1.842 

2.050 

16 

1.500 

1.643 

1.835 

1.571 

1.717 

1.914 

1.631 

1.781 

1.981 

17 

1.453 

1.591 

1.778 

1.521 

1.663 

1.854 

1.580 

1.725 

1.920 

18 

1.410 

1.544 

1.725 

1.477 

1.614 

1.799 

1.534 

1.674 

1.863 

19 

1.371 

1.502 

1.677 

1.436 

1.569 

1.749 

1.491 

1.628 

1.811 

20 ! 

1.335 

1.462 

1.633 

1.398 

1.528 

1.703 

1.452 

1.585 

1.764 

21 

1.302 

1.425 

1.592 

1.363 

1.490 

1.661 

1.416 

1.545 

1.720 

22 

1.271 

1.391 

1.554 

1.331 

1.454 

1.621 

1.382 

1.509 

1.679 

23 

1.242 

1.360 

1.519 

1.300 

1.421 

1.584 

1.351 

1.474 

1.641 

24 

1.215 

1.330 

1.486 

1.272 

1.390 

1.550 

1.321 

1.442 

1.605 

25 

1.189 

1.302 

1.455 

1.246 

1.361 

1.518 

1.294 

1.412 

1.572 


* Reproduced from K. O. Bowman and M. A. Kastenbaum, “Sample size requirement: Sin¬ 
gle and double classification experiments” in Selected Tables in Mathematical Statistics ， 
Vol. 3 (1975), by permission from the authors and the American Mathematical Society. 
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Table 6.9 Number of Replications in CRD 


1-/? 

A* 

1.0 

1.5 

2.0 

.7 

17 

8 

5 

•8 

21 

10 

6 

.9 

>25 

13 

8 


treatment, for example the control treatment. Although preliminary studies are usually 
rather small, they should be sufficiently large to get a reliable estimate of that is，an 
estimate based on a sufficient number of degrees of freedom. 


6.9 SUBSAMPLING IN A CRD 

As we have pointed out earlier (see also Section 2.3)，a careful distinction must be 
made between experimental units (EU) and observational (sampling) units (OU). Until 
now we have considered in this chapter the situation where EUs and OUs are identical. 
One consequence of this situation is that even though in the formulation of a linear 
model for observations from a CRD we distinguish between experimental error (Sij) 
and observational error (ry^), we cannot separate the two error terms in the analysis 
and hence we combine them usually into one error term (e^). There are, however, 
situations where EUs and OUs are not identical. We refer to the example in Section 2.3, 
where a class of students is the EU and the individual students are the OUs. This 
situation is generally referred to as a CRD with subsampling. 


6.9,1 Subsampling Model 

Suppose then we have t treatments, each replicated r' times, and each EU has n OUs, 
that is, n observations are obtained from each EU. An extension of model (6.19) can 
then be written as 

Vijk = f^ + Ti^r cij + rjij k (6.46) 

(i = 1, 2, ..., t:j = 1， 2, . • • ， r ’； A: = 1， 2, ... ， n) where represents the experimen¬ 
tal error and rjijk the observational error. According to our convention, we treat the e 勿 
as i.i.d. (0, cr?) and the rjijk as i.i.d. (0, a^). We note that 

var(y ijfc ) = 

just as before except that we can now separate the two variance components, or rather 
their estimates. This becomes obvious from the ANOVA table (see Table 6.11) asso¬ 
ciated with the linear model (6.46) which in linear model theory is referred to as a 
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Table 6.10 Determination of Number of Replications 


a. ) Input Statements : 
proc power; 

onewayanova test=overall 

groupmeans = -.5| 0 I .5 

stddev = 1 

npergroup =. 

power = .9 

alpha = .05; 

run; 

b. ) Output : 


The SAS System 

The POWER Procedure 
Overall F Test for One-Way ANOVA 

Fixed Scenario Elements 


Method 


Exact 

Alpha 


0.05 

Group Means 

-0 」 

3 0 0.5 

Standard Deviation 


1 

Nominal Power 


0.9 


Computed N Per Group 

Actual N Per 

Power Group 


0.908 


27 
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two-fold nested classification: the EUs are nested within the treatments and the OUs 
are nested within the EUs (see also Section 4.12 for the definition of a nested classifi¬ 
cation). The ANOVA table can be obtained easily from the following identity 

Vijk = 5 • • • + (Si.. — 5 …） + (Pij. ~ Vi.) + (Vijk ~ Vij .) 

mimicking model (6.46) and proceeding along the same lines as indicated in Sec¬ 
tion 6.4. 


6.9.2 Inferences with Subsampling 

It follows from Table 6.11 that in order to test the null hypothesis of no treatment 
differences we use the F-test (again, as an approximation to the randomization test) 


MS(T) 

MS(EE) 


(6.47) 


with t — 1 and t{r f — 1) d.f. Also, as pointed out earlier, and as is obvious from the 
五 (MS) in Table 6.11，the experimental and observational error variance components 
can be estimated separately, namely 

= MS(Oi；) (6.48) 

and 

= [MS(EE) - MS{OE)]/n. (6.49) 

Since the use of several observations per EU, that is ， subsampling, does not con¬ 
stitute replication of treatments, and since the d.f. for the F -test (6,47) are determined 
by t and 〆 and not by n, we may ask: What are the benefits that arise from subsam¬ 
pling? We have already pointed out one benefit，namely the separation of the estimates 
for experimental and observational error variance components. This allows us a closer 
look at our experimental and measurement techniques or rather the quality of these 
techniques expressed in terms of their variability. If we find, for example, that is 
unreasonably large, we may try to improve the reliability of our measurement process; 
or if (3^ is quite large, we may take another look at the EUs and their “homogeneity” 
and decide that we could reduce the experimental error by using supplementary infor¬ 
mation (see Chapter 8) or another design such as a randomized complete block design 
(see Chapter 9). Reduction of error, and by that we mean reduction of = cr? + 
is an important aspect of experimental design. 


6.9.3 Comparison of CRDs without and with Subsampling 

Another benefit of subsampling is that, even though it is not a substitute for replication, 
it may nevertheless lead to a reduction in the number of replications for the treatments, 
compared to a CRD without subsampling. 
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We have seen in Section 6.8 that the number of replications r required may be 
quite large, in fact larger than is possible for practical and economical purposes. We 
may ask: How can subsampling be of some help? Suppose we can choose between two 
situations: 


Plan I: CRD with r replications and no subsampling, that is, r l = r, n = 1. 

Plan II: CRD with r’ replications and subsampling of n > 1 OUs per EU with r f < r. 


In plan I the F-test is based on t(r — 1) d.f. in the denominator and the noncentrality 


parameter is 


rZrf 

2(<j2 + <J2) 


(6.50) 


whereas for plan II the F-test is based on t(r f — 1) d.f. in the denominator and the 
noncentrality parameter is 


r'nEr? .一 r'Erj 
2K + rwr|) - 2 ( 导 + 〜 2 ) ’ 


(6.51) 


Since the power of the F-test increases with the d.f. and the noncentrality parameter, 
plan II can be better than plan I only if An > Ai since t(r f — 1) < t(r — 1). Exactly 
what this relationship should be is hard to tell in general since this depends obviously 
on the values of r ， 〆 ， n，in a complex way. One way to look at this somewhat 
constructively is to compare var ( 仿 •. 一访 ， ••），that is, the variance of a simple treatment 
comparison, for both situations. Specifically, this variance for plans I and II is given by 

vari = 2(a\ + cr^)/r 
and 

varn = 2(cr^ + no^)/r’n 
r f n r' } 5 



(6.52) 


(6.53) 


respectively. One of the aims of experimental design is to reduce var(^.. - 仏 /•.) 
as much as possible. Expression (6.53) shows clearly that this cannot be done by 
increasing n alone; that reduces only one component and usually the less important 
one at that. We, therefore, have to consider both r' and n carefully in our choice of the 
design. A useful relationship between r, r\ n can be obtained by equating (6.52) and 
(6.53) and letting = bo\. We find then that 


rS 

— _ 

r’(l -f (5) — r 

or 

,_ r(5 + n) 

T n(6 + 1)' 

The way we may use these relationships is as follows: 


(6.54) 

(6.55) 
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(i) Based on an appropriate choice of A* [see (6.45)]，find r from the Bowman and 

Kastenbaum (1975) tables (see Table 6.8). We note that △* does not depend on 
the choice of the design, that is, the CRD without or with subsampling; in either 
case a e = +a^) 1/2 ; 

(ii) choose an r f in the neighborhood of r with r' < r; 

(iii) specify a value for S based on empirical or theoretical evidence; 

(iv) use (6.54) to determine an appropriate n, rounding up to integer values. 

We illustrate this procedure with the following example. 

Example 6.7: Suppose t = 5, a = .05, l ~ 3 = .80, A* = 1.50. From Table 6.8 
we find r = 12. For 6 = .50, .75, 1,00 the possible choices of r f and n are given in 
Table 6.12. □ 

The results of Table 6.12 show, that in general, 

(i) we only have a limited number of choices for r' ; 

(ii) as r f decreases, n increases rapidly; 

(iii) as S increases, more options for r’ are available; 

(iv) the total number of observations, tr f n ， for the CRD with subsampling is always 
considerably larger than the total numbers of observations, tr, for the CRD with¬ 
out subsampling. 

The important point of this whole discussion is that we must carefully evaluate 
our options before embarking on an experiment, taking the investigator’s aims, the 
availability of experimental material, and limitation of resources into account. Only 
then can we avoid major disasters at the end of the experiment. 

6.10 TRANSFORMATIONS 

An important aspect of the analysis of experimental data is that of the scale of measure¬ 
ment. Problems can arise because of reasons mentioned below, namely nonadditivity 
of unit and treatment effects and nonconstancy of variances. Both phenomena are pos¬ 
sibly related and may be resolved by transformation of the data to a more appropriate 
scale. 


6.10.1 Nonadditivity in the General Sense 

The reader will have noted that throughout we have made critical use of the idea of 
additivity of treatment contribution and unit contribution [see (6.3) and (6.16)]. This 
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Table 6.12 Numbers of Replications and Size of Subsamples for 

r = 12 


(5 = 

=.50 

5 = 

=.75 

5 = 

: 1.00 

r / 

n 

〆 

n 

r 1 

n 

11 

2 

11 

2 

11 

2 

10 

2 

10 

2 

10 

2 

9 

4 

9 

3 

9 

2 



8 

5 

8 

3 



7 

36 

7 

6 


in essence amounts to a choice of scale of measurement. For example，if we have unit- 
treatment additivity for \ZTik, that is, y/Tik = Ti + Uk, then we obviously do not have 
additivity for 

T ik = 7f + 2TiU k + Ul 

There has been extreme negligence with regard to this aspect of experimental inference. 
The problem was addressed by Neyman et al. (1935) and McCarthy (1937), but never 
by Fisher, which led to angry disagreements between Neyman and Fisher in the 1930s. 
Kempthorne (1952, Chapter 8) addressed this problem, but mainly in the context of 
the randomized complete block design. It is intrinsic in analyses of experimental data 
that additivity holds; otherwise different experiments will lead to the existence of in¬ 
teraction between experiments and treatments. It is, of course, not possible to establish 
for which scale of measurement additivity holds, but it is plausible to associate nonad¬ 
ditivity with nonconstancy of variances, and that can often be removed by a suitable 
transformation of the observations. 


6.10.2 Nonconstancy of Variances 


An essential aspect of the analysis of data from a CRD using model (6.19) or (6.46) is 
the constancy of variance of the observations. In general, this is not an unreasonable 
assumption. There exist, however, situations where this assumption is clearly not true. 
For example, in an experiment to compare different nutrients with respect to germi¬ 
nation rate of certain seeds we have n seeds in each pot (EU) and for each seed the 
observation is y = 1 if the seed germinates and = 0 if it does not germinate. If for 
the zth treatment the probability (rate) for germination is Pi, then obviously 


Eijjijk) = Pi 
var(t/2jfc) — 一 Pi) 
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and 

= Pi 

— 、 以1 — 巧) 
var (㈣ .）= --- • 

Hence if there are differences among the treatments, then we do not have constancy 
of variances. This is an example where the variance is a function of the mean of the 
observations. More generally we can express this as follows. If y is the observation 
with E(y) = (j,*, we then write 

vav(y) = (6.56) 

where g(ix*) is some function of /i* (we note here that in our case /i* = M ^ if 
the observation y is obtained for the ith treatment and that for this reason the g(^) in 
(6.56) are possibly different). To equalize or stabilize the variance across treatments 
we use a transformation of the observations y, say f(y), such that 

var[/(y)] = constant = c 2 ， say. (6.57) 

6.10.3 Choice of Transformation 

To determine a suitable transformation we use the Taylor series expansion of f(y) 
around fj,* (in the statistical literature this is also referred to as the method of statis¬ 
tical differentials or the delta method) and write 

f(y) = /("*)+ /(〆 ）（ "_〆）+ remainder (6.58) 

Taking the expected value of both sides of (6.58) gives 

E[f{v)\ = /(〆 ） 

and hence, using (6.56) and (6.58), 

var[/(y)] S [/’(〆)]V(〆). （ 6.59) 

It follows then from (6.57) and (6.59) that 

齡^ m 齡/命"' (6 . 6 °) 

For observations for which (6.56) holds (at least approximately) because of theoreti¬ 
cal considerations or because of empirical evidence (by plotting the residuals, for ex¬ 
ample), we can then determine an appropriate transformation f(y) from (6.60). Ex¬ 
amples of some well-known transformations (see Bartlett, 1947; Kempthorne, 1952, 
Section 8.5) are given in Table 6.13. 
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It is of course, obvious from (6.58) and (6.59) that the transformations given in 
Table 6.13 will not achieve complete constancy of variance. They serve, however, as 
basic transformations and are quite satisfactory from a practical point of view even if 
the form of g{^) is not entirely correct. Modifications to some of the transformations 
in Table 6.13 have been proposed to enhance their performance, that is, come closer to 
achieving constancy. Freeman and Tukey (1950) proposed to replace transformation 1 
in Table 6.13 for np* ^ 1 by 

f{y) = Vy+ + 1 

with var[/(y)] = 1, and transformation 4 for /i* > 1 by 


f(v) = o 


arcsin A 


y 


n + 1 


+ arcsinA 


/y + 1 

n + 1 


6.10.4 Power Transformations 


For situations where there does not necessarily exist a relationship between the mean 
and the variance as discussed in the previous section, Box and Cox (1964) have pro¬ 
posed a parametric family of transformations: 


f{y )= 


vW = iy x - i)/^ 

y(0) = logy 


(A^O) 


(6.61) 


Because of the form of the transformations (6.61) they are also referred to as power 
transformations. The general idea here is to estimate A from the data and then use 
y(X) as the actual transformation, where A is the estimate of A. Since the scale of the 
transformed observation depends on A, that is, A, Box and Cox (1964) suggested to use 
the normalized transformation 


之 (A)=(〆- 1 胸入 _ W 0) 

^(0) = 2 /logy 

instead, where y is the geometric mean of the observations y. 

Since the primary objective of the transformations (6.61) is to achieve normality the 
estimator for A is obtained by assuming a multivariate normal distribution for y ( 入 ） with 
constant variance, the secondary objective, for a “simple” linear model, the tertiary 
objective. With these assumptions, that is, objectives, A can then be obtained using 
the theory of maximum likelihood. Also, approximate confidence limits for A can be 
obtained giving the user a wider choice for A which may be helpful in interpreting 
the transformation, that is, rather than using, for example, A = —.9 a more suitable, 
and perhaps plausible, choice may be to use A = —1. As normality is not a crucial 
assumption in our discussions, we shall not pursue this topic further but refer the reader 
to the Box and Cox (1964) results. 
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6.11 EXAMPLES USING SAS® 

The aim of this chapter is to explain and describe the nature, philosophy, and properties 
of the CRD together with the underlying linear model and the associated analysis. The 
reader will have noticed that we did not pay any attention to purely computational is¬ 
sues. The reason for this is, of course, that statistical packages are available to perform 
the analysis conveniently and without any difficulty. We shall not mention the various 
computer packages available (their number rises almost daily) as every user has his or 
her favorite. Instead we shall present some examples and make some comments about 
the analysis using the Statistical Analysis System (SAS) (SAS Institute, Inc., 2002- 
2003) as a representative of many suitable programs. It is assumed that the reader has 
some familiarity with the fundamentals of SAS. 

Example 6.8: Consider an experiment to compare the efficiency of five different 
types of gasoline, measured in miles/gallon. Ten cars, all of the same make and model, 
are available. Each car is randomly assigned a particular type of gasoline such that 
each type is assigned to two cars. Each car is driven a specified route and the gas 
consumption, converted to miles/gallon, is recorded. 

This is a CRD with t = 5 treatments and r = 2 replications. Each car is the EU 
and OU. 

The analysis is performed using SAS PROC GLM. The input statements are given 
in Table 6.14a and the output is given in Table 6.14b. 

We comment briefly on some aspects of the output: 

(i) It is useful to have the data set printed out, since it provides an easy and con¬ 
venient way to detect obvious typographical errors that may have occurred in 
the data recording and/or input (we note here that in the following we may not 
always adhere to this suggestion in order to conserve space). 

(ii) The ANOVA table is of the form given in Table 6.2. For the CRD the Model SS 
and the Brand Type I and Type III SS are identical. 

(iii) The P-value of 0.0049 indicates that there are differences among the brands of 
gasoline. 

(iv) The brand least squares means are identical to the means. The _P-values given 
here are of no interest as they test H 0i : /i + ^ = 0(i = 1 ， 2, . ， . ， 5). 

□ 

Example 6.9: As an extension of the experiment described in Example 6.8 we con¬ 
sider now the situation where each car is driven three times and a measurement is 
obtained after each drive. We thus have a CRD with subsampling, the drives represent¬ 
ing the subsamples. More precisely we then have t = 5, r' — 2, n = 3. The data 
are provided in Table 6.15a together with the SAS input statements using, mainly for 
purposes of comparison, SAS PROC GLM and SAS PROC MIXED. The output for 
both procedures is given in Table 6.15b. 
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Table 6.14 Data Analysis for CRD 


a.) Input Statements : 

data crdgas ； 

input brand miles ; 

datalines; 

1 25.8 1 23.9 

2 28.5 2 27.9 

3 22.3 3 24.0 

4 29.5 4 28.7 

5 26.0 5 25.8 

r 

run; 

proc print data=crdgas; 

titlel ， DATA FOR CRD (t=5, r=2) f ; 

run; 


proc glm data=crdgas; 
class brand; 
model miles=brand; 
lsmeans brand/stderr; 
titlel 'ANALYSIS OF CRD"; 
run; 


b.) Output : 


DATA FOR CRD (t=5, r=2) 
Obs brand miles 


1 1 25.8 

2 1 23.9 

3 2 28.5 

4 2 27.9 

5 3 22.3 

6 3 24.0 

7 4 29.5 

8 4 28.7 

9 5 26.0 

10 5 25.8 
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Table 6.14 {Continued) 





ANALYSIS 

OF CRD 





The GLM Procedure 





Class Level 

Information 




Class Levels Values 




brand 

5 12 3 4 

5 



Number of Observations Read 

10 



Number of Observations Used 



Dependent 

Variable : miles 






Sum of 




Source 

DF 

Squares 

Mean Square 

F Value 

Pr > F 

Model 

4 

47.23400000 

11.80850000 

15.66 

0.0049 

Error 

5 

3.77000000 

0.75400000 



Corrected 

Total 9 

51.00400000 





R-Square 

Coeff Var 

Root MSE 

miles Mean 



0.926084 

3.309191 

0.868332 

26.24000 


Source 

DF 

Type 工 SS 

Mean Square 

F Value 

Pr > F 

brand 

4 

47.23400000 

11.80850000 

15.66 

0.0049 

Source 

DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

brand 

4 

47.23400000 

11.80850000 

15.66 

0.0049 



Least Squares Means 






Standard 




brand 

miles LSMEAN 

Error 

Pr > |t| 



1 

24.8500000 

0.6140033 

<•0001 



2 

28.2000000 

0.6140033 

<.0001 



3 

23.1500000 

0.6140033 

〆 A rv -1 
^ \j 1. 



4 

29.1000000 

0.6140033 

<•0001 



5 

25.9000000 

0.6140033 

<.0001 
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We make the following comments concerning the output: 

For PROC GLM: 

(i) The model SS contains both SS(brand) and SS(car within brand), the latter rep¬ 
resenting -in a technical sense - the experimental error; 

(ii) both sums of squares in (i) are separated in the Type I and Type III ANOVAs; 

(iii) the F-value and the P-value for brand under Type III AN OVA are incorrect and 
should be disregarded (see (iv) below); 

(iv) the correct F- and P-values for brand are provided after specifying in the in¬ 
put statement that, according to Table 6.11, F=MS(brand)/MS(car (brand))= 
33.13/1.36 二 24.48 with P = .0018; 

(v) a statement has to be provided in the input to obtain the correct standard error for 
the least squares means such that, just as in (iv) above, MS(car(brand)) should 
be used as the appropriate MS (experimental error). 

For PROC MIXED: 

(vi) In the input statement the source car(brand) is recognized as the experimental 
error by declaring the effect as a random effect; 

(vii) as a consequence of the statement in (vi) above estimates for and will be 
obtained. The estimation method used is, by default, REML (REsidual Max¬ 
imum Likelihood) (for details about REML see Section II. 1.11.2). We obtain 
<jg = .39 and = .19. For balanced data the REML estimates are the same as 
those given in (6.49) and (6.48), respectively; 

(viii) the test for no differences among brands is now performed correctly without 
further input. The same holds for the standard errors of the least squares means; 

(ix) the above comments show that PROC MIXED appears to be the preferred method. 
Only if we like to obtain the ANOVA table as given in Table 6.11 should we use 
PROC GLM. □ 


6.12 EXERCISES 

6.1 Consider the following results from a CRD with t = 2 treatments and r = 4 
replications for each treatment: 

EU# 1 2 3 4 5 6 7 8 

Response 7 2 1 6 4 8 10 4 

Treatment 2 1 1 1 2 2 2 1 

(i) Using the ratio MS(T)/MS(E) as the test criterion perform the randomiza¬ 
tion test for Ho : Ti = 丁 2 . 
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Table 6.15 Data Analysis for CRD with Subsampling 


a.) Input Statements : 

data mileage; 

input brand car miles 

datalines; 

11 25.8 1 1 25.6 1 1 26.0 1 2 23.9 1 2 24.2 1 2 23.5 

21 28.5 2 1 28.0 2 1 28.4 2 2 27.9 2 2 28.1 2 2 28.4 

31 22.3 3 1 22.7 3 1 23.0 3 2 24.0 3 2 23.1 3 2 23.5 

41 29.5 4 1 27.5 4 1 29.1 4 2 28.7 4 2 29.0 4 2 28.8 

51 26.0 5 1 25.7 5 1 26.1 5 2 25.8 5 2 25.6 5 2 26.3 

) 

run; 


proc glm data=mileage; 
class brand car; 
model miles=brand car(brand); 
test H=brand E=car(brand); 
lsmeans brand/stderr E=car(brand); 


titlel A ANALYSIS OF CRD W/SUBSAMPLING^; 

title2 f USING PROC GLM"; 

run; 

proc mixed data=mileage; 

class brand car; 

model miles=brand; 

random car(brand); 

lsmeans brand; 

title2 'USING PROC MIXED"; 

run; 


b.) Output : 


ANALYSIS OF CRD W/SUBSAMPLING 
USING PROC GLM 

The GLM Procedure 

Class Level Information 

Class Levels Values 

brand 5 12345 

car 2 12 


Number of Observations Read 
Number of Observations Used 


30 

30 
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Table 6.15 (Continued) 


Dependent Variable : miles 

Sum of 


Source 

DF 

Squares 

Mean Square 

F 

Value 

Pr > F 

Model 

9 

140.0466667 

15.5607407 


80.21 

<.0001 

Error 

20 

3.8800000 

0.1940000 




Corrected 

Total 

29 143. 

9266667 





R-Square 

Coeff Var 

Root MSE 


miles Mean 


0.973042 

1.683265 

0.440454 


26.16667 

Source 

DF 

Type I SS 

Mean Square 

F 

Value 

Pr > F 

brand 

4 

133.2433333 

33.3108333 


171.71 

<•0001 

car(brand) 

5 

6.8033333 

1.3606667 


7.01 

0,0006 

Source 

DF 

Type III SS 

Mean Square 

F 

Value 

Pr > F 

brand 

4 

133.2433333 

33.3108333 


171.71 

<•0001 

car(brand) 

5 

6.8033333 

1.3606667 


7.01 

0.0006 

Tests of Hypotheses Using the Type III 

MS for car(brand) as an 

Error Term 

Source 

DF 

Type III SS 

Mean Square 

F 

Value 

Pr > F 

brand 

4 

133.2433333 

33.3108333 


24.48 

0.0018 



Least Squares Means 





Standard Errors and Probabilities Calculated Using the Type 工工工 MS for 

car(brand) as an Error Term 


brand 

miles LSMEAN 

Standard 

Error 

Pr > 11 | 

1 

24.8333333 

0.4762119 

<.0001 

2 

28.2166667 

0.4762119 

<.0001 

3 

23.1000000 

0.4762119 

<.0001 

4 

28.7666667 

0.4762119 

<.0001 

5 

25.9166667 

0.4762119 

<•0001 
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Table 6.15 (Continued) 


ANALYSIS OF CRD W/SUBSAMPLING 
USING PROC MIXED 


The Mixed Procedure 


Model Information 


Data Set 

Dependent Variable 
Covariance Structure 
Estimation Method 
Residual Variance Method 
Fixed Effects SE Method 
Degrees of Freedom Method 


WORK.MILEAGE 
miles 

Variance Components 

REML 

Profile 

Model-Based 

Containment 


Class Level Information 


Class Levels Values 


brand 5 12345 

car 2 12 


Iteration History 

Iteration Evaluations -2 Res Log Like 

0 1 58.65095075 

1 1 48.64765549 

Convergence criteria met. 

Covariance Parameter 
Estimates 


Criterion 

0.00000000 


Cov Parm 


Estimate 


car(brand) 0.3889 

Residual 0.1940 


Fit Statistics 


-2 Res Log Likelihood 48.6 
AIC (smaller is better) 52.6 
AICC (smaller is better) 53.2 
BIC (smaller is better) 53.3 
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Table 6.15 (Continued) 


Type 3 Tests of Fixed Effects 



Num 

Den 



Effect 

DF 

DF 

F Value 

Pr > F 

brand 

4 

5 

24.48 

0.0018 


Least Squares Means 
Standard 


Effect 

brand 

Estimate 

Error 

DF 

t Value 

Pr > |ti 

brand 

1 

24.8333 

0.4762 

5 

52.15 

<•0001 

brand 

2 

28.2167 

0.4762 

5 

59.25 

<•0001 

brand 

3 

23.1000 

0.4762 

5 

48.51 

<.0001 

brand 

4 

28.7667 

0.4762 

5 

60.41 

<.0001 

brand 

5 

25.9167 

0.4762 

5 

54.42 

<.0001 


(ii) Compare the significance level achieved in (i) to that of the usual F-test. 


6.2 A researcher has done a preliminary study in the form of a CRD with subsam¬ 
pling to help him decide on the final design. He wants to compare five (5) treat¬ 
ments. In the preliminary study he has used two (2) experimental units for each 
treatment and two (2) observations per experimental unit. 

The partial ANOVA for the data from the preliminary study is given below: 

Source SS 

Treatments 43.58 

Expt. Error 55.00 

Sampling Error 30.00 

and the 5 treatment means are 10.00, 12.30, 11.80, 14.25, 13.56. 

In the final experiment the researcher wants to compare the same 5 treatments. 
He wants to use a CRD with or without subsampling. 

(i) Suppose he wants to detect, with probability .9, approximately the same 
difference between the best and worst treatment as observed in the study, 
using a test of size .05. Based on the information available from the pre¬ 
liminary study, how many replications does he need for a CRD without 
subsampling? 
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(ii) Suppose the testing of hypotheses is not so important. He would like to 
consider possible CRDs with subsampling that achieve a variance of treat¬ 
ment comparisons no larger than the variance for the CRD obtained in (i). 
Give possible options of designs. 

6.3 An agronomist conducted a field trial to compare the relative effects of five par¬ 
ticular fertilizers on the yield of Trebi barley. Thirty homogeneous experimental 
plots are available and six were assigned at random to each fertilizer treatment. 
At harvest time, three sample quadrats were taken (at random) from each exper¬ 
imental plot and the yield was obtained for each of the 90 quadrats. 

(i) What is the name of the experimental design used? Give an appropriate 
model for analyzing the data from this experiment. 

(ii) The agronomist has consulted two “statisticians” for the analysis of the 
data. He wants to know whether there are differences among the fertilizer 
effects. He is confused by the three SAS printouts (see Tables 6.16 a, b, c) 
and he comes to you for help to find out which analysis is correct. Based on 
the information provided explain how an appropriate test should be carried 
out. 

(iii) What is the variance of a single observation [express as a formula based 
on your model statement in (i)] and how much of this variance is due to 
experimental error and to observational (sampling) error (use the numerical 
information provided). 

(iy) Give the standard error for a simple treatment comparison. 

6.4 A pharmaceutical company conducts an experiment to compare 5 drugs. 30 
animals are available for the trial. Each drug is injected into 6 randomly selected 
animals. All the animals are very similar. After an appropriate period of time 2 
blood samples are taken from each animal and duplicate analyses are made for 
each blood sample. The reading from each analysis represents the observation to 
be used for the statistical analysis of this experiment. 

(i) What kind of experimental design has been used? 

(ii) What feature does this design have which we have not encountered before? 

(iii) How many types of errors can you identify for this design? Give their 
names. 

(iv) Write out an appropriate model for this design. 

(v) Based upon this model, outline the AN OVA table, giving sources of varia¬ 
tion and degrees of freedom. 

(vi) Using the ANOVA table, indicate how you would test the hypothesis that 
there are no differences among the drugs. 
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6.5 For the CRD with t treatments, r 7 replications per treatment and n observations 
per EU, show that 


var 




r'n 


Table 6.16 SAS Inputs and Outputs for Exercise 6.3 


a.) SAS Code : 

proc~glm; 
class ~ fert; 
model y = fert; 
run; 

SAS Printout (3A) : 

General Linear Models Procedure 
Class Level Information 

Class Levels Values 

FERT 5 12345 

Number of observations in data set = 90 

General Linear Models Procedure 

Dependent Variable:~Y 

Sum of Mean 


Source 

DF 

Squares 

Square 

F Value 

?r > F 

Model 

4 

65240.711 

16310.178 

242.72 

0.0001 

Error 

85 

5711.778 

67.197 



Corrected 

Total 89 

70952.489 





R-Square 

C.V. ] 

Root MSE 

Y Mean 


0.919499 

10.20988 

8.1974 


80.289 

Source 

DF 

Type I SS 

Mean Square 

F Value 

Pr > F 

FERT 

4 

65240.711 

16310.178 

242,72 

0.0001 

Source 

DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

FERT 

4 

65240.711 

16310.178 

242.72 

0.0001 
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Table 6.16 {Continued) 


b.) SAS Code : 

proc glm; 
classes fert rep; 
model y = fert rep(fert); 
run; 

SAS Printout : 

General Linear Models Procedure 
Class Level Information 


Class 

Levels 

Values 


FERT 

5 

1 

2 

3 4 

5 

REP 

6 

1 

2 

3 4 

5 6 



Number of observations in 

data set = 

90 



General 

Linear Models 

Procedure 



Dependent 

Variable: Y 







Sum of 

Mean 



Source 

DF 

Squares 

Square 

F Value 

Pr > F 

Model 

29 

67139.822 

2315.166 

36.43 

0.0001 

Error 

60 

3812.667 

63.544 



Corrected 

Total 89 

70952.489 





R-Square 

C.V. 

Root MSE 


Y Mean 


0.946265 

9.92B493 

7.9715 


80.289 

Source 

DF 

Type I SS 

Mean Square 

F Value 

Pr > F 

FERT 

4 

65240.711 

16310.178 

256.67 

0.0001 

REP (FERT) 

25 

1899.Ill 

75.964 

1.20 

0.2811 

Source 

DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

FERT 

4 

65240.711 

16310.178 

256.67 

0.0001 

REP (FERT) 

25 

1899.111 

75.964 

1.20 

0.2811 


c.) SAS Code : 

proc glm; 

classes fert rep; 
model y = fert rep(fert); 
test h=fert e=rep(fert); 
run; 
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Table 6.16 (Continued) 


SAS Printout : 


General Linear Models Procedure 
Class Level Information 
Class Levels Values 

FERT 5 12345 

REP 6 123456 

Number of observations in data set = 90 


General Linear Models Procedure 


Dependent 

Variable : Y 

Sum of 

Mean 



Source 

DF 

Squares 

Square 

F Value 

Pr > F 

Model 

29 

6713S.822 

2315.166 

36.43 

0.0001 

Error 

60 

3812.667 

63.544 



Corrected 

Total 89 

70952.489 





R-Square 

C.V. 

Root MSE 


Y Mean 


0.946265 

9.928493 

7.9715 


80,289 

Source 

DF 

Type 工 SS Mean Square 

F Value 

Pr > F 

FERT 

4 

65240.711 

16310.178 

256.67 

0.0001 

REP (FERT) 25 

1899.111 

75.964 

1.20 

0.2811 


Source DF Type III SS Mean Square F Value Pr > F 

FERT 4 65240.711 16310.178 256.67 0.0001 

REP (FERT) 25 1899.111 75.964 1.20 0.2811 

Tests of Hypotheses using the Type III MS for REP (FERT) as an error 
term 

Source DF Type III SS Mean Square F Value Pr > F 

FERT 4 65240.711 16310.178 214.71 0.0001 



CHAPTER 7 


Comparisons of Treatments 


7.1 INTRODUCTION 

We have mentioned earlier that the aim of any experiment is to compare treatments. 
One way to do this is in the context of the ANOVA by testing the hypothesis Hq: Ti = 
r 2 = •• = rt = 0. In most situations, however, the result from such a test is not very 
informative. To arrive at the conclusion that there are differences among the treatments 
(typically associated with a small P-value) is generally no major surprise. The immedi¬ 
ate question then is: which treatments are different from each other, or how large is the 
difference between certain treatment effects, or is there a systematic pattern describing 
the magnitudes of treatment effects? As so often, the type of question(s) depends on 
the particular experiment and the kind of treatments used in the experiment. 

It is obviously impossible to anticipate all conceivable questions. There are, how¬ 
ever, certain kinds of comparisons that can be grouped into a number of categories 
based on the statistical methodology used to analyze such comparisons. We shall give 
a brief discussion of these methods in the context of the CRD, but the reader should 
have no difficulty adapting these methods to other designs, as described in later chap¬ 
ters, as well. 


7.2 PREPLANNED COMPARISONS FOR 
QUALITATIVE TREATMENTS 

In many experiments the t treatments under investigation bear some relationship to 
each other which immediately suggests that certain comparisons are of more interest 
than others. These comparisons are indeed often the basis for the experiment. They are 
referred to as a priori comparisons or preplanned comparisons. 
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7.2.1 Treatment Contrasts 

As an example consider the following experiment. We want to compare the effective¬ 
ness of two different pesticides, A and B say, both applied in two different forms: 
spray (Ai and B\) and powder (A2 and B2). A control (that is, no pesticide), C say, 
is included in the experiment in order to establish any effectiveness of the pesticides at 
all. Thus we have altogether t = 5 treatments, C, Bi, B 2 , and each treatment 

is applied randomly to r more or less uniformly infested plots of land. The aim of the 
experiment and the “structure” of the treatments suggest the following comparisons: 

(i) control vs. pesticides, 

(ii) pesticide A vs. pesticide B, 

(iii) application Ai vs. A2, 

(iv) application B\ vs. B 2 , 

(v) spray vs, powder, 

(vi) A\ vs. 

(vii) A2 vs. B2. 

Using model (6.22) for the observations from this experiment, we can express any 
of the comparisons above in terms of the treatment effects as 

5 

Q = ^2 c ^ Ti ( 7 . 1 ) 

i=l 

(l = 1,2,...,7)，where the cu are constants such that = 0 for every l and the T{ 
represent the treatment effects as indicated below: 

i 1 2 3 4 5 

Treatment C A\ A2 B\ B2 


7.2.2 Orthogonal Contrasts 

The coefficients cu in (7.1) for the contrasts (i)-(vii) above are given in Table 7.1. A 
closer look at the coefficients for C 1; C2, C3, C 4 reveals that 

^ cuci'i — 0 (7.2) 

i 

for /, l f = 1.2, 3,4 and l ^ V. Any two contrasts, C\ and C"，for which (7.2) is 
satisfied are called orthogonal contrasts. In this example then Ci, C 2 , C3, and C 4 are 
orthogonal contrasts and so are Ci, C 5 , Cq, and C 7 . Each of these two sets of contrasts 
are referred to as a complete set of orthogonal contrasts for the five treatments. 
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Table 7.1 Coefficients for Contrasts 


Contrast 



i 



Divisor 

1 

2 

3 

4 

5 

Ch 

4 

-1 

-1 

-1 

-1 

\/20 

c 2 

0 

1 

1 

-1 

-1 

2 

c 3 

0 

1 

-1 

0 

0 


c 4 

0 

0 

0 

1 

-1 

V2 

c 5 

0 

1 

-1 

1 

-1 

2 

c 6 

0 

1 

0 

-1 

0 

^2 

c 7 

0 

0 

1 

0 

-1 

V2 


In general, fort treatments there exist many (actually infinitely many) complete sets 
of t — 1 orthogonal contrasts each, but only few (if any) are useful for interpreting the 
results of the experiment. These are usually suggested by the structure of the treatments 
as in the example described above or as determined by the factorial structure of the 
treatments (see Chapter 11 and Chapter II.7). 


7.2.3 Partitioning the Treatment Sum of Squares 


A special feature of orthogonal contrasts is that they can be incorporated easily in the 
ANOVA in the sense that the sums of squares for the individual contrasts in a complete 
set provide a partitioning of the treatment sum of squares, SS(T) in Table 6.1，into t — 1 
single d.f. sums of squares. To show this, consider the contrast C\ as given in (7.1). An 
estimator for C\ is obviously 

Ci = - ( 7 . 3 ) 


var(Ci) = 

i 

The sum of squares associated with C\ is then given by 


SS(Q)= 


IQ1 2 

var(C/)/^| 


E 


Clil/i. 




(7.4) 


We assume, without loss of generality, that = 1 for / = 1,2,. ■. 彳 一 1. We 
refer to such comparisons as standardized contrasts. These can be obtained by replac¬ 
ing the coefficient cu in (7.3) by q* = c u/ c w ， f° r every i = 1 ， 2, .. •，We 
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then have Ei(c*-) 2 = 1 . For example, for the contrast coefficients in Table 7.1 the 
standardizing divisors are given in the last column of Table 7,1. For ease of notation 
we drop in the following the asterisk in c 匕 . 


Then (7.4) reduces to 


or, if we write 


and 


it follows that 


To show that 


SS(Ci) = r cuyj, 

.i . 

c ； = (cu,c ； 2, - - - , C;t) 

f = (U 2 .，… ，汍 .) 

SS(C ; ) = r[c ； y] 2 . 

t-l 

SS(T) = ^SS(C,) 


(7.5) 

(7.6) 


we consider the following set of t orthogonal linear functions of y expressed in matrix 
notation as 

( c ?\ 


y = C'y say 


(7.7) 




that is, 


with 


zi = c；y (Z = 0, 1，… ，t - 1) 


c '^Vt 3 ' 

and J being a vector of t unity elements. It follows then that C / is an orthogonal matrix, 
that is, 

CC = 1 


and hence 

C = (C , )- 1 , 


which implies that 

We then have from (7.7) and (7.8) 


CC' = I. 


(7.8) 


z'z = fCC'y = fy = ^yl 


( 7 . 9 ) 
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Table 7.2 ANOVA for CRD with Orthogonal Contrasts 


Source 

d.f. 

SS 

五 (MS) 

Treatments 

t-1 

SS(T) 


Cl 

1 

SS(C!) 

^e+r(c[r) 2 

c 2 

1 

SS(C 2 ) 

^ + r(c' 2 r) 2 

Ct-i 

Error 

1 

t(r - 1 ) 

SS(C t _i) 

SS(E) 

4 + u) 2 


On the other hand, it follows from (7.7) that 

z，z = Y, z i 


1=0 


1=0 




t-1 


E [你 2 


=段 2 + 1]蝴 2 

i=i 

Equating the right-hand sides of (7.9) and (7.10) we obtain 

谙.-均 2 

1=1 i 

=H(n) 2 . 


(7.10) 


(7.11) 


Multiplying both sides of (7.11) by r and making use of (7.5) we thus obtain (7.6) and 
hence Table 7.2. 

This result can be incorporated into the ANOVA table by amending Table 6.1 as 
given in Table 7.2, This shows that in order to test 

Hq: = 0 

with r — (ri ， r 2 , .... we use, as an approximation to the randomization test, the 
F-test 

SS(C0 


F 




(7.12) 


MS ㈤ 

Alternatively, an approximate (1 - a) 100% confidence interval for cjr is given by 

，MS ⑻、 1/2 


cjy 


: /2’t(r-l) 


(7.13) 
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or, more generally, 

Cl ±tl- a /2,t{r-l) v3r (5/)] . 

We conclude this section by emphasizing that, although the structure of the treat¬ 
ments often suggest a complete set of orthogonal contrasts, one should not insist on 
considering only orthogonal contrasts. Other contrasts, as long as they are preplanned, 
may be more informative for explaining the results from an experiment. 

7.3 ORTHOGONALITY AND ORTHOGONAL 
COMPARISONS 

Having discussed the concept of orthogonal comparisons for the equal replication CRD 
it seems appropriate and necessary to give some formal description of the basic ideas 
of and connection between orthogonality and orthogonal comparisons. 

Orthogonality is a concept that is usually and easily applied to vectors. Let z / = 
{zi,Z 2 ) ... : z m ) represent an m-vector. Linear forms in z may be represented by c(z 
where = (ca, Q 2 ,.. ■ ， Cj m ). Two linear forms, c[z and c^z, are said to be orthogonal 
if c[c 2 = 0 , that is, if the vectors Ci and C 2 are orthogonal vectors. 

This notion of orthogonality is used when we refer to orthogonal treatment contrasts 
as discussed in the previous section, for example, two contrasts, c[r and d 2 r with 
= c f 2 0 = 0, are orthogonal if c[c 2 = 0. But these comparisons also bear on 
random variables, namely, when we consider the estimated comparisons. For this we 
need to consider comparisons of random variables. 

Let x’ = (xi,X2, • • •, x m ) be random variables with variance-covariance matrix V. 
Let c^x and c^x be linear functions of x, with = c f 2 0 = 0. The two comparisons, 
c^x and c^x, are said to be orthogonal if cov^x, c^x) = 0 , which is equivalent to 
c[\C 2 = 0. The term orthogonal as used with random variables, z\ and 22 say, means 
that cov(zi. Z2) = 0. It is unfortunate that the adjective orthogonal was taken over from 
the mathematics of inner product spaces to the lack of covariance of random variables 
(just as, indeed, the term independence was taken over and leads to confusion). 

Returning to estimated treatment comparisons, using observations from a CRD, 
we need to consider functions of random variables of the form c^y and c^y, where 
y = (yi ， 安 2 , •. • ，没 t)'. We have shown in Section 6.3 that the variance-covariance 
matrix for y, using (6.13) and (6.14), is given by 



for additivity in the strict sense, and 

V = 4 + 4 ) - 

for additivity in the broad sense. It follows then immediately for the equal-replication 
CRD that if two treatment contrasts, c[r and c f 2 r, are orthogonal then the estimated 
contrasts, c^y and c^y, are also orthogonal, that is, satisfy c[\C 2 = 0. 
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This result does, however, not hold for the unequal-replication CRD as considered 
in Section 6.7. The important change in V as given above is that (l/r)I has to be 
replaced by R -1 ， where 


/ n \ 

丄 0 

R- 1 = r2 

\o tJ 

It is then obvious that ci VC 2 # 0. 

The implications of these results are that for the equal-replication CRD the contrast 
SSs are orthogonal (see Table 7.2)，but for the unequal-replication CRD they are not, 
that is, (7.6) does not hold. 


7.4 COMPARISONS FOR QUANTITATIVE 
TREATMENTS 

As mentioned earlier, orthogonal contrasts are often suggested a priori by the treatment 
structure. This should not, however, be taken as a general rule. Other preplanned (and 
hence nonorthogonal) comparisons may be more suitable for answering the investi¬ 
gator^ questions. The only difference to the discussion above is that (7.6) does not 
hold, but tests of the form (7.12) and confidence interval estimation of the form (7.13) 
can still be carried out. The main point here is that the comparisons are preplanned 
in a meaningful way, that is，determined by the intent of the experiment and not by 
the outcome of the experiment, and that the number of such comparisons is generally 
small. 


7.4.1 Comparisons for Treatments with Equidistant 
Levels 

Another type of preplanned comparison arises if the treatments represent quantitative 
levels of some factor, for instance, increasing amounts of fertilizer. Rather than com¬ 
pare individual levels with each other, it is more informative to investigate whether 
there exist certain trends in response to the treatments, for example, whether the in¬ 
crease (decrease) is linear or whether there exists some curvature. 

Suppose X 1 .X 2 ,... represent the t levels such that X\ = 0,a：2 = d, = 
2 d,..., Xt = (t~l)d with d> 0, that is, the levels are equidistant, with x — {t—l)d/2. 
Without loss of generality we may take d = 1 and obtain 


Zi= Xi~~ 


2 


(7.14) 
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as the centered levels, with T，iZi = 0. We then consider the contrast 


c L = t 

i=l 



(7.15) 


which is of the form (7.1) with cu = Zi/y/T,zf,. The estimator for (7.15) is given by 

C L = y r lL^ y i . (7.16) 

1 多 4 

Using the fact that Hzi = 0 we recognize immediately that (7.16) is the estimate of the 
linear regression coefficient in the model 

负 .=a + 0 Wi + error 

with Wi = Zi/y/Lzf,, that is, 0 = Cl- Alternatively, for the models 

yi. = a + 8* Zi + error (7.17) 


or 


负 =a* + 9*Xi + error 


the estimator for /?* is 0* = Thus, in either case, Cl is a measure of the 

linear increase (decrease) due to the increasing treatments. 


7.4.2 Use of Orthogonal Polynomials 

Model (7.17) may not describe adequately the relationship between treatments and 
responses. We may, for example, want to allow for curvature in the response by con¬ 
sidering a model of the form 

负.二 a 丄 f3iZi + (3 2 zl + error. (7.18) 

This can be done by using the methods of regression analysis, that is, estimate a, 0\ , 02 
by the method of least squares (see Chapter 4). If the emphasis, however, is to find out 
whether curvature exists and of what kind it is, then it is more convenient to rewrite 
(7.18) in terms of so-called orthogonal polynomials as 

Vi. = loPo{zi) + ^iPi(zi) + 72^2(^) + error 


or, more generally, 

t-i 

Vi. = y^^iPi(zj) + error. (7.19) 
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Here the Pi(z{) are polynomials of degree 1(1 = 0,1..... ,t — 1) such that 

t 

Pi(zi) = 0 (Z = l ， 2 ， ... ， t-1) (7.20) 

Z = 1 

t 

， S ^Pi{zi)Pif{zi) = 0 (l l'). (7.21) 


The Pi {zi) are, apart from a constant factor, Tchebycheff polynomials used in statistics 
first by Fisher (1921). Specifically, for Z = 0, 1 ， 2, 3, 4 we have 

Po(zi) = 1 


尸 1 ( 之 i )= 入1 之 i 

巧 ㈤ = 入2 4 - - 丄） 

尸3(々） =入3 — 7)4 


P^{zi) = A 4 zf - ▲( 汾 2 - 13 )^ 2 + ▲( 亡 2 _ 丄沿 2 _ 9 )， 

where the 入 《 are chosen such that the Pi(zi) are positive or negative integers. These 
polynomials have been tabulated by, for example, Fisher and Yates (1957)，Pearson 
and Hartley (1970), and Beyer (1991) for t = 3,4,,.and l = 2,3,4,5, 6. For 
t = 3,4,5,6, the Pi(zi) are given in Table 7.3 (for Z = 1, 2, 3, 4 these values can be 
obtained by substituting the Zi of (7.4) into the above expressions). 

The estimator for the regression coefficients in (7.19) are obtained by the method 
of least squares as 




i _ 


(? = 0,1， 2,. • • ，艺一 1) 


(7,22) 


with 



(7.23) 


We can deduce easily, using (7.22) for / = 1 together with Pi(zi) = Ai^, that 



Cl 
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Since Xiy/'Ezf, is a constant it follows then, of course, that 71 , too, is a measure 
of the linear trend of the response variable. Just as 71 represents the linear trend or 
contrast, 72 represents the quadratic contrast, 73 the cubic contrast, and so on. One of 
the important properties of the formulation (7.19) in terms of orthogonal polynomials is 
that the estimates are independent of the number of polynomials included in (7.19), 
for instance, 71 is the same whether (7.19) is a linear model or a quadratic model or a 
cubic model, and so on. 


7.4.3 Contrast Sums of Squares and the ANOVA 

Since the orthogonal polynomials represent the coefficients of a complete set of orthog¬ 
onal contrasts, it follows further that, as described earlier, this can be used to partition 
SS(T) into t — 1 single d.f. sums of squares associated with 71 , 72 , ， 7 亡一 1 ， respec¬ 
tively. The sum of squares associated with 7 ^ SS( 7 !) say, is 

SS(7/) = r(7 ； ) 2 E^^)] 2 (7-24) 

i 

(Z = 1 ， 2,.. •, t — 1) using the rule for a single d.f. sum of squares. Then 

t-i 

SS(T) = ^SS( 7( ). (7.25) 

^=1 

To investigate the trend of the treatment effects, we would typically fit a low order 
model first, then check for lack of fit (LOF), and, if necessary, add further terms. Sup¬ 
pose we fit first the model 

Q 

Vi •二 ^^iPi(zi) + error ( 1 . 26 ) 

1=0 


with q < t — 1, then the ANOVA takes the form as given in Table 7.4, with 

Q 

SS(LOF) = SS(T) - SS(7i)- 


Then 


MS(LOF) 

MS(E) 




(7.27) 


can be used to test whether (7.26) provides an adequate fit to the data. If the F-value in 
(7.27) is significant at a given level, then there is lack of fit and additional terms should 
be added to the model repeating the procedure just described. 

The method of fitting orthogonal polynomials is quite easy to carry out. It depends, 
however, heavily on the fact that the levels of the treatment factor are equidistant, be¬ 
cause this is the reason why orthogonal polynomials can be tabulated. For nonequidis- 
tant levels the method can be used also, except that one has to compute the polynomials 
sequentially, using the properties (7.20) and (7.21) (fora computational method see, for 
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Table 74 ANOVA for CRD Fitting Orthogonal Polynomials 


Source 

d.f. 


ss 

Treatments 

t-1 

ss(.r) 


7i 

1 


SS( 7 i) 

72 

1 


SS( 72 ) 

IQ 

1 


SS( 7 ,) 

Lack of fit 

t ~ q — l 


Difference = SS(LOF) 

Error 


SS{E) 



example, Narula, 1978). This is rather cumbersome and worthwhile only if the same 
experiment is being used repeatedly. It is for this reason that equidistant levels (actual 
levels or transformed levels such as log dose) should be used wherever possible. 

In concluding this section we should emphasize that fitting polynomials may not 
always be appropriate. One may, for example, be interested in finding an asymptote to 
the response curve and then a model of the form 

yi, = a -h 3e~ 1Zi + error 

or some variation of it might be used. The important point is that the nature and aims 
of the experiment should provide guidance for the analysis (see Chapter 2). 


7.5 MULTIPLE COMPARISON 
PROCEDURES 

7,5.1 Multiple Comparisons and Error Rates 

It is rare that in a well thought out experiment the treatments do not have any struc¬ 
ture which would lead naturally to the types of comparisons discussed in Sections 7.2 
and 7.4 or Chapter 11. Occasionally, however, it may be desirable to make a large 
number of comparisons, for example, all possible pairwise comparisons, or compar¬ 
isons suggested by the data (data-snooping). Whether one talks in terms of hypothesis 
testing or interval estimation, great care must be taken to use correct inference proce¬ 
dures. The major problem here is the so-called multiplicity effect (Tukey, 1977) which 
may lead to too many significant tests if incorrect procedures are used. The problem 
revolves around the notions of comparisonwise error rate (CWE), familywise error 
rate (FWE), and per family error rate (PFE). Basically, the CWE is being used for 
situations discussed in Section 7.2, whereas the FWE or PFE are being controlled in 
the types of comparisons mentioned above. Both FWE and PFE take the number of 
comparisons to be made (these constitute the “family”)，iV say, into account. If we 
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have 亡 =10 treatments, for example, we have A 7 = (^ 0 ) = 45 pairwise comparisons. 
The idea then is to control the error rate for this family rather than for each individual 
comparison, where error here refers to the Type I error in the context of hypothesis 
testing. The relationship between CWE and FWE can be expressed as (Hochberg and 
Tamhane, 1987; Westfall et al., 1999) 

1 - FWE = (1 - CWE) iV 

or 

FWE = 1 - (1 - CWE)' (7.28) 

In order to achieve a certain FWE (say .10), we can use (7.28) to determine the CWE 
to be used for each individual test. As an approximation for small CWE and not too 
large N ， we have 

FWE x CWE. (7.29) 

A number of multiple comparison procedures (MCPs), sometimes also referred to as 
post-hoc comparisons, have been developed which control the FWE either in the weak 
sense (under Hq., 丁 1 = 丁 2 = … = 丁 t only) or in the strong sense (that is, under all 
configurations of the r^s) (Hochberg and Tamhane, 1987). Most of these procedures 
have been available for some time and are widely used, but their acceptance is not 
universal. We, too, caution against their uncritical use but at the same time we feel that 
MCPs can play a useful role in data analysis. In what follows we describe briefly a few 
MCPs, but defer details to books on this subject such as Miller (1981), Hochberg and 
Tamhane (1987), Hsu (1996), Westfall et al. (1999). ‘ 

Although all MCPs test the same null hypothesis they use different methods for 
controlling the FWE. They also differ in their sensitivity to alternative hypotheses. 
For these reasons different MCPs applied to the same data may give different results. 
Studies have been undertaken to compare the different MCPs (see Carmer and Swan¬ 
son, 1973; Miller (1981); Stoline, 1981) but sometimes seemingly contradictory results 
make it difficult to give general recommendations which MCP to use in a given situa¬ 
tion. 

7.5.2 Least Significant Difference Test 

This test was first proposed by Fisher (1935) and is now often referred to as Fisher’s 
protected LSD test. It has two stages. At stage I we test Ho m . ri = T 2 = = r t = 0 

by the F-test of size a. If the F-test is nonsignificant, we terminate the analysis. If the 
F-test is significant, then at stage II we test each single comparison Hq ： Ti = by an 
a-level t-tesi with t(r — 1) d.f. 

7.5.3 Bonferroni 艺 -Statistics 

This test is based on the Bonferroni inequality 


1 - FWE > 1 - A T x CWE. 
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Making use of (7.29), it uses a*-level t-tests with t{r — 1) d.f. for the individual tests 
Hq ： — Ti> where 

a* = a/N 

and a is the chosen FWE. Tables for the upper percentage point of the Bonferroni 
亡 -statistic, t a / 2 A\t(r-i) are given by Miller (1981). 


7.5.4 Studentized Range Procedure 

This procedure was proposed by Tukey (1952, unpublished). To test the hypotheses 
Ho ： Ti = Ti'(i # i') we compare | 奶 .一 办 ,| with Q a ^tu(r-i) v / MS(£ ； )/r, where 
Q a ,t,t{r-i) is the upper a 100% point of the studentized range distribution for t inde¬ 
pendent, unit normal, random variables divided by the square root of an independent 
X^/ v random variable with v = t{r — 1) d.f. If, for a given a, 

\Vi. - Vi'.\ > (7.30a) 

then Ti and are considered to be different from each other. Tables for Q a ,t,t(r-i) 
are given by e.g., Harter (I960)，Hochberg and Tamhane (1987). 

The MCP described above has been extended by Kramer (1956) to accommodate 
unequal numbers of replications. This method is referred to as the Tukey-Kramer 
method. It simply replaces (7.30a) by 


\yi. ~ Vi'.\ ^ Qa,t,iyj ^ + MS(_B )， (7.30b) 

where v = — t. 

The test procedures given by (7.30a) and (7.30b) control the FWE at a given a. It 
is then easy to see that simultaneous (1 — a) 100% confidence intervals for all compar¬ 
isons T{ — Ti' can be obtained as 


Vi. 一 Vi’ ， 士 Qa, t.u 



( 易 4) MS ㈣ 


7.5.5 Duncan’s Multiple Range Test 

This test was developed by Duncan (1951 ， 1955) to specifically test all hypotheses 
Ho - Tf = T{> by considering different error rates depending on the range of the two 
corresponding treatment means y it and 歹 i'.. If, among the ordered treatment means, 
Vi. and yi\ are p means apart, then an -level studentized range test is conducted 
comparing \y L - \jy/MS{E)/r with the critical value where 

a p = 1 - (l - a) p ~ x . 

We start by arranging the t treatment means in ascending order, say 

y[i],y[2}^ - ： V[t] 
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and compare 卽 ] — 5 ⑴ with Q at ,t,t(r-i) y/MS(E)/r. If this difference is nonsignif¬ 
icant then all other differences are also nonsignificant; if the difference is significant, 
however, then (t - 1)-differences y[t] — V[2) and y[t-i\ — V{\] are considered and com¬ 
pared with Q at _ 1( £-i.t(r-i)\/MS(S)/r. This procedure is continued, reducing the 
critical value at each step, until no more differences can be declared significant. Tables 
for the critical values are given by Harter (1960) (see also Miller, 1981). 


7 . 5.6 Scheffers Procedure 


One of the most general MCPs is that proposed by Scheffe (1953) forjudging all possi¬ 
ble contrasts in the analysis of variance setting. It is especially useful for data-snooping 
after the F-test for Hq : ri = t<i = ••- = r t has been found significant, because the 
FWE for all possible contrasts is equal to a, the size of the F-test. 

To test the hypothesis 

t 

Ho' ^ CiTi = 0 

i—l 

for all (ci, C 2 , •.. ， c t ) such that Eci = 0, we compare 





with the critical value 

[(f - 1)6-0 ，卜 ; ^(r-i)] 1 ’ 2 . 

Alternatively, this procedure can be used to construct simultaneous (1 — a) 100% con¬ 
fidence intervals for all contrasts in the form 


Ciyi, ± [(i - l)-Fl-a^-l,t(r-l )] 1/2 x 


. i 


US{E) 

r 


1/2 


7.5.7 Comparisons with a Control 

In some experiments the t treatments may consist of a control treatment and t — l what 
may be referred to as test treatments, and the aim of the experiment is to compare the 
test treatments against the control (see also Section 9.8.2 and II.6.5). If treatment 1 is 
the control, then testing the hypotheses 

Ho ： Ti= Ti (i = 2,3,...,t) 

with FWE = q can be achieved by a procedure due to Dunnett (1955, 1964). Rather 
than compare the usual statistic 

(7.31) 

against the critical value of the t-distribution, we compare (7.31) with the critical value 
for two-sided tests and D a ^i^ r -i),p for one-sided tests. Tables 
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of these critical values are given by Hochberg and Tamhane (1987, Tables 5 and 4, 
respectively) using p = .5 (here p is the correlation between yi.— 奶 . and yi. 一 D. 

So far we have discussed the CRD with the same number of replications for all 
treatments. The situation described above, that is, one control, t — 1 test treatments, is 
one where it may be advisable to have unequal numbers of replications for the treat¬ 
ments. Specifically, we may use r replications for the test treatments and r c (> r) 
replications for the control in order to estimate " + n as precisely as possible. For this 
case, the test procedure remains the same, except that the critical values change for two 
reasons: (i) The d.f. for MS(jE') are now u — r c — 1 (t — l)(r — 1), and (ii) the 
correlation coefficient between yi, - yi, and y\, — is now given by 

p* = - < .5. 

r c + r 

Critical values |D| a ，卜 1 火 ，〆 and are given by Hochberg and Tamhane 

(1987) for p* = .1 and .3. Dunnett (1964) also provides a method for approximat¬ 
ing the critical values for values of p* other than those given above and for values pa' 
where treatment i is replicated Vi times and not all equal. 

It should be clear, of course, that Dunnett’s procedure cannot only be used for 
testing but also for obtaining (1 — a) 100% simultaneous confidence intervals for r\ — 
n(i = 2,3 ，...，作 

7.5.8 Alternatives to Tests Based on Normality 

Throughout our discussion in Chapter 6 of the analysis of data from a CRD, we have 
not made any assumptions about the underlying distribution of the data. We have relied 
on the approximation of the F-test to the randomization test to carry out tests in the 
AN OVA and we have similarly used indications of a computational nature to use the t- 
test as an approximation to the corresponding randomization test for follow-up studies 
(such as discussed in Section 7.2). The implication, of course, is that the assumption of 
normality is not generally of crucial importance for such tests (see also Scheffe, 1959). 

The procedures discussed in this section all depend on the assumption of normality, 
and there are strong indications that they are not very robust against deviations from 
normality (Ringland ， 1983). This is especially true for the Bonferroni procedure and 
less so for the Scheffe procedure. One alternative in such situations is to use non- 
parametric procedures. We shall not discuss this here any further, but nonparametric 
analogs to some of the procedures described here are presented and discussed by Miller 
(1981) and Hochberg and Tamhane (1987). One of the problems with MCPs is that in 
many cases they lead to results that are not easily interpreted (see also Section 7.4) and 
this problem may become even worse using nonparametric MCPs. In both cases the 
choice of the error rate a will be important, say a = .10 or even a = .20, certainly 
larger than the conventional a = .05. 

Another alternative to nonnormal situations is to use robust estimators for the treat¬ 
ment effects, such as Af-estimators (Huber, 1981). This, however, leads to great diffi¬ 
culties in that the distributions of the test statistics will be difficult, if not impossible, 
to obtain and one would have to rely on Monte Carlo simulations or asymptotic results. 
These prospects together with the general difficulties of MCPs are not very promising. 
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7.6 GROUPING TREATMENTS 

One of the objectives of MCPs, apart from comparing every treatment with every other, 
is to arrive at groups of “homogeneous” treatments. This will facilitate the interpre¬ 
tation of the results from the experiment and help in making recommendations con¬ 
cerning further action. Unfortunately, the picture is not always clear. Often we find 
overlapping of groups of treatments which are judged (according to some MCP) to be 
not significantly different from each other. 

To illustrate this phenomenon, consider the following example from Snedecor (1946). 
During cooking，doughnuts absorb fats in various amounts. An experiment was done to 
investigate whether the amount of fat absorbed is different for different fats used. Eight 
fats (treatments) were compared, each with six mixes (replications). The treatment (fat) 
means, that is, the average amount of fat (in grams) absorbed by 24 doughnuts, are as 
follows: 


Fat# 1 2 3 4 5 6 7 8 

6pt] height y L 172 178 182 185 165 176 161 162 

If we order the treatment means and perform the studentized range test [see Sec¬ 
tion 7.5.4] with a = .10, we obtain the following result (with MS(E) = 141.6 and 
Q. 10 , 8.40 — 4.099, the critical value is 19.91). 


Fat# 

7 

8 5 

1 

6 2 

3 4 

Vi. 

161 

162 165 

172 

176 178 

182 185 








Here, underlined treatment means are not significantly different from each other. 
Based on these tests we can say, for example, that fats # 7,8 are different from fats 
# 3,4 with 7 and 8, and 3 and 4 not being different from each other. This does not 
mean, however, that (7,8) and (3,4) form two distinct groups, because 7 and 8 are not 
different from 5, 1,6, 2, and 3 and 4 are not different from 1 ， 6, 2; in fact 3 is not 
different from 5 either. In summary, if one wants to establish groups of similar fats 
(for dietary purposes, for example) then it is clear that using the multiple range test 
(or other MCPs) will not accomplish that. What is needed is a procedure that uses，in 
combination with hypothesis testing, ideas of cluster analysis. 

Such methods were developed by Scott and Knott (1974) and by Calinski and 
Corsten (1985). We shall describe here one of the methods proposed by Calinski and 
Corsten (1985) which is based on an extension of the studentized range procedure (see 
Section 7.5.4). 

This procedure is a stepwise procedure and is referred to as a hierarchical, agglom- 
erative procedure which uses ordinary distance as a working criterion. The adjective 
“hierarchical” means that once a treatment mean is included in a homogeneous cluster 
it will not be deleted in a subsequent step, and the adjective “agglomerative” means 
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that at each step two adjacent clusters (each or both consisting possibly of one element 
only) are combined to form a new cluster. 

The algorithm starts with t clusters, namely the t treatments, represented by the 
treatment means, = 1 ， 2, … ， 亡 )， arranged in increasing order. At the first step 
the two closest treatments, as measured by the smallest \yi, — 仿 ，.|， are combined to 
form one cluster and the range R\ = \yi, — yi'\ is compared with the critical value 
C a = Qa,t ! 4( r -i)v^MS(£ , )7r for a given a. At each following step a new cluster is 
formed by combining two adjacent clusters with the smallest range. The range R s at 
step 5(1 彡 s 彡 t -1) is then compared with the critical value C a . If R s > C a then the 
process stops and the clustering obtained at step s-1 will be the accepted grouping of 
the treatments. The groups thus formed are considered to be internally homogeneous 
with the studentized range test at significance level a. 

We illustrate this procedure in Figure 7.1 for the example given above. Choosing 
q = .10 , the critical value is 

C io — 4.099 - ^ - = 19.91 

and from Figure 7.1 it can be seen that the process stops at step 7 since i ?7 = 24 > 
C.io = 19.91. Hence the grouping arrived at prior to step 7 will be accepted, that is, 
fats 5, 7, 8 form one group and fats 1 ， 2, 3, 4, 6 form another group. 

By using a fixed q, the probability of terminating too early, and hence accepting 
too many homogeneous groups is bounded by a. For small a this may lead，in fact, 
to too few groups. Here, as with MCPs in general, the choice of a is important and 
an a of .10 or .20 may not be unreasonable. Rather than choosing an a, Calinski and 
Corsten (1985) mention the possibility of computing at step s the probability 

MS(E)1' 1/2 ' 

， 

r 」 

that is, the smallest significance level at which the observed maximum range R s would 
lead to the rejection of the null hypothesis associated with the partition at step s. These 
probabilities can be obtained by using the computer program given by Dunlap, Powell, 
and Konnerth (1977). 

7.7 EXAMPLES USING SAS® 

Example 7.1: Consider the experiment described in Section 7.2 with 5 treatments 
and r = 2 replications in a CRD. The data are given in Table 7.5a. 

We use SAS PROC GLM to evaluate the following orthogonal comparisons from 
Section 7.2.1: (i) ， (ii), (iii) ， (iv), using contrast and estimate statements (see Table 
7.5a). The contrast statements are used to obtain the contrast SS. The estimate state¬ 
ments provide estimates of the contrasts and their standard error. We also perform 
Tukey’s multiple comparison procedure, providing tests for simple treatment compar¬ 
isons and simultaneous confidence intervals. We note that generally we would not 
consider orthogonal contrasts and multiple comparisons for the same experiment. 


= Pr Q t .t(r-i) > 
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6 i?6 = 13 


7 = 24 


Figure 7.1 Grouping of Snedecor data using the Calinski-Corsten procedure. 


One word of caution for the input of the contrast and estimate statements: Since we 
have used alpha-numeric labeling for the treatments, that is, C, Al, A2, Bl, B2, SAS 
writes them in alphabetical order as Al, A2, Bl ， B2, C. This requires us to enter the 
contrast coefficients in this order. 

We now turn to the output in Table 7.5b and make the following comments: 

(i) The basic ANOVA provides — .514. 

(ii) There are significant differences among the treatments (P = 0.0012). 

(iii) Writing the treatment LS means in increasing order and using a = .05 the results 
from the Tukey multiple comparison procedure can be summarized as follows, 
where treatments not connected by the same line are significantly different from 

Treatment: A2 C B2 A1 B1 

each other: LS rnean: 13.15 13.35 15.90 18.20 19.10 

Tukey (a = .05): - 二 _ 

The table in the SAS output provides exact P-values for the comparisons. 
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(iv) The simultaneous 95 % confidence intervals for the pairwise comparisons con¬ 
firm the results given in (iii), but provide additional information about the differ¬ 
ences. 

(v) The sum of the contrast SS equals, of course, the SS(Treatments). 

(vi) All specified contrasts are significantly different from zero. □ 

Example 7.2: Kuehl (1994) describes an experiment studying the relationship be¬ 
tween grain production and plant density. Using a CRD t = 5 plant densities (10, 20, 
30, 40, 50) were used, each density was replicated r = 3 times. The data are given in 
Table 7.6a. 

Since we have quantitative treatments we use the method of orthogonal polynomi¬ 
als (Section 7.4.2) to obtain the functional relationship (response curve) between yield 
and plant density. The input statements for SAS PROC GLM are given in Table 7.6b. 

We make a few comments: 

(i) Among the input statements we have included “contrast” and “estimate” state¬ 

ments. With the estimate statements we have given the divisor ^2i[Pi( z i)} 2 °f 
(7.22). - 一 — 

(ii) The contrast coefficients are obtained from Table 7.3 for t = 5 ‘ 

(iii) The output shows that the linear and quadratic coefficients are significant (P. .0001), 

(iv) The relationship between yield and density can therefore be expressed as 

V-i = lo Po{Zi) + 7i Pl{Zi) + 72 P2{Zi) 

with 

% = 16.40 = mean, 71 = 1.19, 72 = —1.01. □ 


Table 7.5 CRD with Orthogonal Contrasts and Multiple 
Comparisons 


a) Input statements: 

data pest; 
input trt S yield 
datalines; 

C 12.8 C 13.9 
A1 18.5 A1 17.9 
A2 12.3 A2 14.0 
B1 19.5 B1 18.7 
B2 16.0 B2 15.8 
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Table 7.5 (Continued) 

proc glm data=pest; 
class trt; 
model yield=trt; 

lsmeans trt/pdiff adjust=tukey cl; 
contrast 'C vs trt 1 1 1 1 -4; 
estimate，C vs trt，trt 1 II 1 -4/divisor=4; 
contrast ’A vs B’ trt 1 1-1-10; 
estimate ’A vs B’ trt 1 1-1-1 0/divisor=2; 
contrast *A1 vs AT trt 1 -1 0 0 0; 
estimate ’A1 vs A2’ trt 1 -1 0 0 0; 
contrast ’B1 vs B2’ trt 0 0 1 -1 0; 
estimate ’B1 vs B2’ trt 0 0 1 -1 0; 

title 1 ’COMPLETELY RANDOMIZED DESIGN (t=5, r=2 )，； 

titJe2 5 ORTHOGONAL CONTRASTS AND MULTIPLE COMPARISONS ’； 

run; 

b) Output: 


COMPLETELY RANDOMIZED DESIGN (t=5, r=2) 


ORTHOGONAL CONTRASTS AND MULTIPLE COMPARISONS 


The GLM Procedure 
Class Level Information 
Class Levels Values 


trt 


A1 A2 B1 B2 C 


Number cf Observations Read 
Number of Observations Used 


10 

10 


ORTHOGONAL CONTRASTS AND MULTIPLE COMPARISONS 
The GLM Procedure 


Dependent Variable : yield 


Source 


DF 


Sum of 

Squares Mean Square F Value Pr > F 


Model 


59.17400000 14.79350000 


28.78 0.0012 


Error 


2.57000000 0.51400000 


Corrected Total 


9 61.74400000 


R-Square 
0 . 958377 


Coeff Var 
4.497729 


Root MSS 
0.716938 


yield Mean 
15.94000 


Source 

DF 

Type I SS 

Mean Square 

F Value 

Pr > F 

trt 

4 

59.17400000 

14 • 79350CC0 

28.78 

0.0012 

Source 

DF 

Type III SS 

Mean Square 

F Value 

?r > F 

trt 

4 

59.17400000 

14.79350000 

28.78 

0.0012 


Least Squares Means 

Adjustment for Multiple Comparisons : Tukey 

LSMEAN 


trt 

yield. LSMEAN 

Number 

A1 

18.2000000 

1 

A2 

13,1500000 

2 



234 


CHAPTER 7. COMPARISONS OF TREATMENTS 


C vs trt 

3 

A vs B 

-1 

Al vs A2 

5 

B1 vs B2 

3 


23750000 0.56678920 
82500000 0.5C695167 
05000000 0.71693793 
20000000 0.71693793 


C vs trz 1 16.77025000 16.77025000 32.63 
A vs B 1 6.66125000 6.66125000 12.96 
A1 vs A2 1 25.50250000 25.50250000 49.62 
B1 vs B2 1 10.24000000 10.24000000 19.92 


Parameter 


Standard 

Error 


Table 7.5 {Continued) 

B1 19.1000000 3 

B2 15.9000000 4 

C 13.3500000 5 

Least Squares Means for effect trt 
Pr > !tI for HO: LSMean(i)=LSMean(j) 

Dependent Variable : yield 

i/j 12 3 4 5 

0.0047 0.7260 0.1095 0.0057 

0.0022 0.0590 0.9982 

0.0022 0.0332 0.0026 

0.0590 0.0332 0.0773 

0.9982 0.0026 0.0773 


trt 


yield LSMEAN 

95% Confidence Limits 

Al 


18.200000 

16.896839 

19.503161 

A2 


13.150000 

11.846839 

14.453161 

B1 


19.100000 

17.796839 

20.403161 

B2 


15.900000 

14.596839 

17.203161 

C 


13.350000 

12.046839 

14.653161 

Least 

Squares Means for 

Effect trt 




Difference 

Simultaneous 95% 



Between 

Confidence 

Limits for 

i 

j 

Means 

LSMean{i)- 

■LSMean {j) 

1 

2 

5.050000 

2.174000 

7.926000 

I 

3 

-0.900000 

-3.776000 

1.976000 

1 

4 

2.300000 

-0.576000 

5.176000 

1 

5 

4.850000 

1.974000 

7.726000 

2 

3 

-5.950000 

-8.826000 

-3.074000 

2 

4 

-2.750000 

-5.626000 

0.126000 

2 

5 

-0.200000 

-3.076000 

2.676000 

3 

4 

3.200000 

0.324000 

6.076000 

3 

5 

5.750000 

2.874000 

8.626000 

4 

5 

2.550000 

-0.326000 

5.426000 


Dependent Variable : yield 

Contrast DF Contrast SS Mean Square F Value Pr > F 


1 

2 0.0047 

3 0.7260 

4 0.1095 

5 0.0057 



3 5 s 6 
2 5 0 6 

o 1 o o 
o o o o 

o o o o 
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Table 7.6 CRD with Quantitative Factors: Orthogonal Polynomials 


a) Input statements: 

data density; 

input density yield @@; 

datalines; 

10 12.2 10 11.5 1012.3 
20 16.1 20 15.3 20 16.6 
30 18.6 30 20.1 30 18.4 
40 17.7 40 19.3 40 17.0 
50 17.8 50 16.4 50 16.7 

run; 

proc glm data=density; 
model yield = density; 
class density; 

contrast ’linear’ density -2-10 1 2; 

estimate ’linear’ density -2-10 1 2/divisor=10; 

contrast ’quadratic’ density 2-1-2-12; 

estimate ’quadratic’ density 2-1 -2-1 2/divisor=14; 

contrast * cubic 5 density -120-2 1; 

estimate ’cubic’ density -1 2 0-2 l/divisor=10; 

contrast ’quartic’ density 1-4 6-4 1; 

estimate ’quartic’ density 1-4 6-4 l/divisor= 70; 

title 1 ’CRD WITH QUANTITATIVE FACTORS ’； 

title2 'CONTRASTS USING ORTHOGONAL POLYNOMIALS'; 

run; 

b) Output: 

CRD WITH QUANTITATIVE FACTORS 
CONTRASTS USING ORTHOGONAL POLYNOMIALS 

The GLM Procedure 
Class Level Information 
Class Levels Values 

density 5 10 20 30 40 50 

Number of Observations Read 15 

Number of Observations Used 15 

R-Square Coeff Var Root MSE yield Mean 

0.927949 5.040486 0.826640 16.40000 


Source 

DF 

Type I SS 

Mean Square 

F 

Value 

Pr > F 

density 

4 

88.00666667 

22.00166667 


32.20 

<.0001 

Source 

DF 

Type III SS 

Mean Square 

F 

Value 

Pr > F 

density 

4 

88.00666667 

22.00166667 


32.20 

<.0001 

Contrast 

DF 

Contrast SS 

Mean Square 

F 

Value 

Pr > F 

linear 

1 

42.72133333 

42.72133333 


62.52 

<.0001 

quadratic 

1 

42.80380952 

42.80380952 


62.64 

<.0001 

cubic 

1 

0.28033333 

0.28033333 


0.41 

0.5362 

quartic 

1 

2.20119048 

2.20119048 


3.22 

0.1029 




Standard 



Parameter 

Estimate 

Error 

t Value 

Pr > 1ti 

linear 

1.19333333 

0.15092309 

7.91 

<,0001 

quadratic 

-1.00952381 

0.12755329 

-7.91 

<.0001 

cubic 

0.09666667 

0.15092309 

0.64 

0.5362 

quartic 

0.10238095 

0.05704356 

1.79 

0.102 
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7.8 EXERCISES 

7.1 Consider an experiment to investigate the effects of sugar on the length of pea 
sections grown in tissue culture. A CRD is used with 5 replications for each of 
the treatments: 

Ti： Control (nothing added) 

T 2 ： 2% glucose added 
r 3 : 3% glucose added 
T 4 ： 2% fructose added 
T 5 ： l%glucose + l%fructose added 

(i) Obtain a complete set of meaningful orthogonal contrasts and explain what 
each contrast means. 

(ii) Suppose we obtain the following results: 


Treatment 

1 

2 

3 

4 

5 

Mean 

70.1 

59.3 

58.2 

58.0 

64.1 


and the following partial AN OVA table 
Source SS 

Treatments 538.66 
Error 245.50 

Total 784.16 

Partition the SS(T) into single d.f. sums of squares for the orthogonal 
contrasts obtained in (i) and test the hypotheses that each contrast is equal 
to zero. 

7.2 Consider a CRD with 5 treatments，6 replications for each treatment and 4 obser¬ 
vations for every experimental unit. Suppose the treatments represent increasing 
amounts (a ： i) of fertilizer applied to a certain crop. 

The following (partial) results are obtained: 


Xi 

0 

2 

4 

6 

8 

Vi.. 1 

4,9 

10.0 

13.9 

15.7 

16.3 


SS(EE) = 50.0, SS(OE) = 60.0 
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Using the method of orthogonal polynomials investigate whether 

(i) the data exhibit linear and quadratic trends; 

(ii) first and second order terms provide an adequate fit to the data. 

7.3 Consider the data in Example 7.1. Obtain a grouping of the treatments 
using the method described in Section 7.6. 

7.4 Consider the data in Example 7.1. Perform Dunnett’s procedure (Section 
7.5.7) comparing ,4i, A 2 , Bi, B 2 with C. 

7.5 Using the results from Example 7.2 obtain the prediction equation 

V = 80 0 i density + 02 ( density ) 2 

using (i) the form of the Pi(z-i) given in Section 7.4.2 and (ii) fitting the second 
degree polynomial directly. 
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CHAPTER 8 

Use of Supplementary 
Information 

8.1 INTRODUCTION 

As we have pointed out earlier, one of the main purposes of experimental design is the 
reduction of error. One important component of the overall error is the unit error, ex¬ 
emplified by (see Section 6.3), expressing a certain amount of heterogeneity among 
the EUs. Generally speaking, such variability among the EUs may be systematic or 
random. Consider the following examples for systematic variation: 

(i) a fertility trend exists in a piece of land used for an agronomic trial; 

(ii) animals used in a pharmaceutical experiment may come from different litters; 

(iii) the experimental material for an industrial experiment may come from different 
production processes; 

and for random variation: 

(iy) plants for a growth trial may be of different heights at the beginning of the trial; 

(v) animals for a dietary study may have different initial weights; 

(vi) individuals for an educational study may have different abilities as documented 
by I.Q. or earlier test scores. 

In the case of systematic variation, knowledge of the underlying reasons will lead 
to blocking and hence to more complex designs which will be discussed in subsequent 
chapters. For random variation the additional (supplementary) information can, under 
certain conditions，be used effectively to reduce the error in a CRD. This procedure, 
introduced by Fisher (1932) and referred to as analysis of covariance, is the topic of 
this chapter. For a general description see also Cochran (1957). 
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8.2 MOTIVATION OF THE PROCEDURE 

In its simplest form we have two measurements for each EU (assuming that the EU and 
OU are identical): 

y : the response to the treatment 
x : the supplementary information. 

Examples for the cases (iv)-(vi) above might be 

(iv) y denotes the growth of the plant after exposure to a treatment, x denotes the 
initial height; 

(y) y denotes the final weight after the treatment, x denotes the initial weight; 

(vi) y denotes the test score after the treatment, x denotes a test score before the 
treatment. 

It is assumed that the supplementary information, or covariate, x is independent 
of the treatment. This is a rather crucial assumption with respect to the unbiased es¬ 
timation of differences among treatment effects (Rosenbaum, 1984). This means that 
covariates must be either obtained before the treatment assignment and/or application, 
or they must be known not to be influenced by the treatment, for example the outside 
temperature during a physical exercise in a clinical study. Also, it is known or suspected 
that there exists a functional relationship between the response y and the covariate x. 
We emphasize that this relationship may not be a causal relationship. The co van ate x 
may be correlated with something, often unknown, which causes extraneous variation 
in the response y (Smith, 1957). To illustrate this point Smith (1957) describes an ex¬ 
ample where in a field experiment x represents the amount of weed present in a field 
plot. The weed itself may not have affected the crop yield, rather it may have been a 
surrogate for the soil acidity present in the plot, which is correlated with the growth of 
weed as well as the crop under investigation. For purposes of our present discussion 
we shall assume that the relationship between x and y is linear. In its simplest form, in 
the absence of treatment effects, the data may look as given in Figure 8.1. 

It is informative, for purposes of illustration and motivation, to consider the model 
(6.3) 

Tik = Ti + Uk 

and write it in the form 

T ik = ^ + a + 0{X k -X.) + Ul (8.1) 

that is, the unit contribution Uk is modeled as 


U k =a + 3(X k - X.) + U* k 
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Figure 8.1 Relationship between covariate and response. 


where a, f3 are constants and Xk is the value of a covariate for EU k. Model (6.6) can 
then be rewritten as 


Vij = " + n + a + /? [ 5^ (Xk - X.) + ^2 ^ij u l 

k k 

="* + B + 8{Xij - X..) -h 略 

In the absence of treatment effects, (8.2) reduces to 


( 8 . 2 ) 


Vij = "* + (3{xij - x..) (8.3) 

or 

Z ij = Vij ~ 0{pCij — 2 ..、 = 〆 + (8.4) 

The form of (8.4) suggests that if we adjust the observations by the concomitant 
information /3(xij - x..), then the new “observations” are constant apart from noise 
, where 

Er^j) = 0 


and 


var 



and, most importantly, 

< o- 2 u . 

In fact, measures the variability of the EUs around the regression line. It seems then 


natural to obtain the adjusted observations (8.4) and perform the usual analysis on them. 
The problem, of course, is that usually we either do not know 3 or we think we know 
8 but it is not the correct value (in many cases p = l is used such as 2 二 post-test- 
pretest scores, or z = final weight-initial weight). The question then arises: How do 
we use the supplementary information and how do we make adjustments? 
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8.3 ANALYSIS OF COVARIANCE 
PROCEDURE 

8.3.1 Basic Model 

Generalizing model (8.3), we now consider the model 

Vij = " + n + P{xij -x..) +e^- (8.5) 

for the observations from a CRD where supplementary information of the form de¬ 
scribed above is available {i = 1. 2,..., j = 1,2,..., r). The are considered to 
be i.i.d. random variables with mean zero and variance a 2 e * (to relate e 心 to our earlier 
discussion we can think of e*j being of the form 

e ij = u ij + + ^3 = £ ij + Vij (8.6) 

with (0, of*) and + A graph of the data (yij , Xij ) may then be as 

illustrated in Figure 8.2 for t = 3, r = 6 , where the lines labeled T\, T 2 , T 3 represent 
the linear relationship between y and x for treatments 1，2, 3, respectively, that is, (8.5) 
for z = 1, 2, 3, 

Our aim is to estimate contrasts among the treatment effects, that is, and test 
hypotheses about treatment effects, such as Hq ： t\ 二 T 2 二 ...== 0. Assuming 
again for a moment that we know 0, it follows easily from (8.5) and Figure 8.2 that, 
for example, 

ti - f 2 = yi. - y 2 . - 0[{xi. - x..) - {x 2 . - x.J\ 

= Va x - Va 2 , ( 8 . 7 ) 

where is the y-value at x == x,. for treatment i. The estimator (8.7) is, of course, 
the corresponding difference between the treatment means plus an adjustment due to 
differences in the covariates for the two treatments. For that reason —Va 2 referred 
to as the adjusted treatment difference. 


8.3.2 Least Squares Analysis 

Let us now turn to the more important case with /3 unknown. We shall use the method 
of least squares (for the arithmetic of analysis of covariance see also Section 4.13) to 
obtain estimates of estimable functions involving /i, t “ and /?. Using model (8.5) we 
obtain the normal equations (NE) by minimizing the expression 

Q = ^[Vij - /3{xij - x..)) 2 ( 8 . 8 ) 

hj 
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Figure 8.2 Graph of data (y, x) from CRD. 

with respect to ", Ti(i = 1,2,..., t), (3. Differentiating (8.8) w.r.t. these parameters 
and equating the derivatives to zero leads to the NE 

trfi + rY^ n + 3 一无 ••）= ?/.. (8.9) 

i ij 

rfi + rfi P^2{xij - x..) = yi, (8.10) 

3 

(? = 1 , 2 ,...,^) 

y^(^- - - x..)ri py^^xjj -无 “) 2 = - 无 ..)， 

ij ij ij ij 

( 8 . 11 ) 

where With Eij (xij — x..) — 0 and putting T，ifi = 0 (since 

'Eri = 0) equations (8.9) - (8.11) can be simplified. From (8.9) we obtain 

A = y- (8.12) 

and from (8.10) and (8.12) we obtain 

n = Vi. -y.. - P(xi. - x,.). (8.13) 

Substituting (8.12) and (8.13) into (8.11) yields 

[(〜 -x..)[{yi. - y..)- 0{xi. -X..)} +/3^(x i>7 - - x..) 2 = ~ x..) 

ij ij ij 
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Table 8.1 Auxiliary ANOVA for CRD 


Source 

SS(y) 

SS ⑻ 

SP(:r ， y) 

Treatments 

Tyy 

T xx 

Txy 

Error 

p 

hyy 

Exx 

Exy 

Total 

Q 

°yy 

^xx 

Sxy 


or 

r - x..)(yi. - y..) + 3 -x..) 2 - r ^{xt. - x..) 2 

i ij i 

= ^2( x ij 一元 H ( 8 . 14 ) 
ij 


It is useful to introduce some simplifying notation. In Table 8.1 we give symbols for 
the various sums of squares and sums of products for a CRD, using the yij and Xij as 
“observations.” In Table 8.1, the SS(y) are the same as those in Table 6.1, the SS(x) 
are obtained analogously with the substituted for the and the SP (: r ， y) are sums 
of products rather than sums of squares, for example, 

T xy = r^{xi. -x..)(yi. - y..). 


Using this notation and using the algebraic fact that T pq + E pq = S pq , where p, q are 
replaced by x and/or y, we rewrite (8.14) as 


T X y + P^Sxx — ^xx ) = Sxy 


or 


g = Exy 
’ Exx 


(8.15) 


8.3.3 Least Squares Means 

Under our model assumptions it follows that /}, and ,8 given by (8.12), (8.13), and 
(8.15), respectively, are the BLUEs of ", and respectively. Hence 

A + ^ = yi. - - (8.16) 

is the BLUE for " + %， the response for treatment i. The right-hand side of (8,16) 
is often referred to as the adjusted treatment mean, adjusted for differences in the co¬ 
variates. It is also called the least squares mean for treatment z, which we shall write, 
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for future references, as LSM(rj) [Searle, Speed, and Milliken (1980) refer to // + 
as the population marginal mean (PMM) and to + f ? as the estimated PMM; Lane 
and Nelder (1982) refer to p. + f{ as the predictive margin]. It follows further that the 
BLUE for a treatment contrast HciTi with E^Ci = 0 is given by 

= ^2 Ci ^- - ci ^ L ( 8 . 17 ) 


As a special case we have 

丁 i _ 丁 i’ = — 千 i/ = Vi. — Hi’. _ _ 

which is, of course, (8.7) with /? replaced by t d. 

To obtain the variances of the estimators given above we use the fact that and 3 
are uncorrelated. We then find 


var(/3) 


E xx 


var(/i -h Ti) 
COv(/i 千 = 

var I > ' ^ = 




1 + (Xj. - X,) 2 

r E xx 

{Xj, -x.) 

Exx 

E c " (!>< 


Exx 




(8.18) 



(8.19) 

.<4* 

.On 

(i") 

(8.20) 

.) 

^e* 

(8.21) 


with Eci = 0. 


8.3.4 Formulation in Matrix Notation 

In order to estimate cr^ and to test Hq: 丁 1 = 丁 2 二 … =Tt and /3 = 0 we turn 
to the analysis of variance. For the derivation of the ANOVA table we make use of 
results in Sections 4.12.2 and 4.12.3 where we have discussed the general case. It is, 
therefore, useful to reformulate some of the results above in matrix notation. 

We write model (8.5) as 

y = 0 fi + X T r + X 3 0^e\ (8.22) 

where y = (yii, 2 / 12 , •. • ,ytr)’ is a tr x 1 column vector of the observations, 3 is a 
tr x 1 column vector of unity elements, X r is a tr x t matrix of known constants (zero 
or one), r — (ri.r 2 ,... ，丁 t) f is a t x 1 vector of treatment effects, = (a；n - 
x,., Xi 2 — 无 is a tr x 1 vector of the covariates (expressed as deviation 
from the mean), and e* = (e^.e ^,, e* r ) / is a tr x 1 vector of errors. The NE are 
then of the form 


'3 1 0 O' X r O' X 3 ' 

■ a ■ 
" 


「 3'y] 


X；3 X ； X r X;Xj 

f 

= 

X；y 

, (8.23) 

X' g 3 X' 3 X T X' 3 X 3 

Q 


L x «yJ 
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which can be rewritten as 


tr 

r^t 

0 " 


"A" 


' y.. 


rOt 

rlt 

X% 


f 


X；y 

. (8.24) 

0 

x^x T 



A 


L x ^y. 



With 3^ f = 0 it follows from (8.23) and (8.24) that 

A = y.. 

f^^K(y-Oy..-X30) 

p=(X' 3 X 3 )- 1 ^(y-3y..-X T r). 

Evaluating these expressions leads, of course, to the estimators (8.12), (8.13)，and 
(8.15). 


8.3.5 ANOVA Table 

We now turn to the ANOVA table. Since this is a nonorthogonal ANOVA, we use the 
notation established in Chapter 4 to indicate how the various sums of squares have been 
obtained. Specifically, we obtain the treatment SS as 


SS(X r \0,X f3 ) = SS(3,X r ,X3) - SS(0,X i3 ) 


(8.25) 


with 


SS (丄 X T ， X. 3 ) = ^.. + f'X f T y + /3X^y 
= Ay - + 千飢 + 冷 

i ij 

=try?. +r'^(y i . - y..f 


Exy 

Exx 


- - y-) - r 一 元 .. ） ( 仄 . 一 5..) 


try?, -i- Tyy + 


Exx 


(8.26) 


using the notation of Table 8.1. To obtain SS(3, X/ 3 ) we use the model 


y = 3fi + X^/3 + e*. 


which leads to the NE 


'D'3 




’ y“ - 

xp 

X 而 . 

A 


feyj 
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and the familiar solutions 


A = V- 

P = (X^r^y 

-y-) 

— Jj _ 

- X..) 2 

n 


— Sxy 
^xx 

using again the notation from Table 8.1. It follows then that 
SS(3, X 。） = _..+ px r p y = try?. + 

^XX 


Substituting (8.26) and (8,27) into (8.25)，we obtain 

SS(X r |3,X / 3) = Tyy - + 

^XX ^xx 

We proceed in a similar fashion to obtain the regression SS as 

SS(X, 5 |3,X r ) = SS(J,X r? X^) - SS{D,X r ). 
In order to obtain SS(3, X T ), we use the model 

y = + X T r + e *， 


which leads to the NE 


fo fx T ' 

A 


• y" ■ 

X；3 X ； X r _ 

f. 

= 

x；y 


and the solutions 

= V- 

~r = (X ； X T ) _ 1 [X；y - KD'fi] 

= -X ； (y-5y..) 
r 

Vi. - 1 S.. 

V2. - y- 
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(8.28) 


(8.29) 


Vt. — V..' 
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Hence, 


SS(3,X r ) = jly.. + f'x' T y 

=try?. + i(y - 3y..)'^ T X! T y 

=try?. + r^2{y l . - y..) 2 

i 

=try?. + T yy . 


Substituting (8.26) and (8.30) into (8.29)，we obtain 

■^xx 

Finally, the error sum of squares is obtained as 


SS(I|5 : X r ,X fl ) =SS(Total) unadj . -SS(3 ,X t .X 3 ) 


= Yl y ij ^ tr y ? -~ T yy ~ 

ij 


Exx 


应 . 

E xx ' 


(8.30) 

(8.31) 


(8.32) 


We point out, in passing, the similarity between (8.32) and the form of the error sum 
of squares for a simple linear regression model, such as (8.3), which is our notation is 
given by Syy - S 2 xy /S xx . 

The complete AN OVA table is given in Table 8.2. 

It follows from E(MS) in Table 8.2 that an estimator for is given by 




MS(I|3, X T , X 3 ) 


J VV ~ "r~ 
^xx 


/ [t(r-1)-1], 


(8.33) 


The form of (8.33) shows explicitly that, unless there is no or only a weak linear re¬ 
lationship between the observation y and the covariate x, the variance estimator (jg* 
is smaller than the comparable variance estimator of (6.29), that is, for the CRD 
without supplemental information (see also Section 8.5.1). 

The form of the E(MS) in Table 8.2 suggests immediately to test Hq: 丁 : =T 2 — 
.•. = B = 0 by the F-test, as an approximation to the randomization test (Robinson, 
1973) for large r, 

MS(X T | 丄 x, 3 ) 

- MSCtllX^Xy 

and Hq ： .5 = 0 by 

' MS(X 桌 X T ) 

— MS(l\0.X T .X 3 ) 
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using the appropriate d.f. as indicated in Table 8.2. For a numerical example see 
Section 8.8. 


8.4 TREATMENT COMPARISONS 


8.4.1 Preplanned Comparisons 

As discussed in Chapter 7, the overall null hypothesis n = 乃 = … =rt = 0 is often 
less important and less informative than specific hypotheses of the form EiCiTi = 0 
with Eid = 0. We have shown in Section 8.3 that 


var 


(? 


Ci 亇 i 


} (13 C ^ ) 2 

r Exx 



[see (8.21)] 

Using (8.21) and (8.33) we can then perform the usual tests on single d.f. contrasts 
either in the context of the ANOVA by forming 


SS f CjTi^ 


(E 


Ci 亍 i . 




and 


or, equivalently, by using 


F ' 


%(E 


Ci 丁 i , 


r 十 ~ 


(8.34) 


(8.35) 


(8.36) 


(We should mention here that for a complete set of orthogonal contrasts, Cr, satisfy¬ 
ing (7.7) and (7.8), we no longer have the result (7.6). The reason for this is that even 
though the contrasts are orthogonal, their estimators are not, that is, they are correlated 
as follows from (8.20).) In particular, we may be interested in testing hypotheses about 
simple treatment differences — and, if appropriate, use the multiple comparison 
procedures discussed in Chapter 7. Since we no longer can compare treatment means 
but rather have to use LS means, which not only may have different variances but are 
also correlated, this leads to certain complications and calls for modifications of the 
procedures discussed earlier. 
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8.4.2 Multiple Comparison Procedures 

Suppose we want to use Duncan’s Multiple Range Test (see Section 7.5.5). We may 
begin by arranging the LS means in increasing order 

A + %i，A + %]， …， A + ^i. 

Following the established procedure we then compare jl + f[ij versus [l -j- fj t j, that is, 
f[ij versus f[ t ], by considering 

T[ t ] - f tl] = y [t] . - y[i], - p{x [t] . - X[i].), (8.37) 


where X[ t y is the mean of the rc-variates corresponding to y^].，and so forth, and com¬ 
paring it with 


Qoct 


， t，v 


2 (x [t] , -x {1] y 


一 + 

V 






(8.38) 


where i/ = t(r 一 1) — 1 are the d.f. for error (see Kramer, 1957; Miller, 1981). If 
(8.37) is larger than (8.38)，then the effects of the corresponding treatments are judged 
to be different; if (8.37) is smaller than (8.38) then the treatments are judged to be not 
different from each other, and ordinarily the comparison procedure would stop (see 
Chapter 7). Such a property no longer holds in the present situation where the variance 
of the difference of LS means is not constant but does depend on the x-means for the 
treatments involved [see (8.38)]. For this reason treatments are compared with different 
precisions, more precise if the z-means are close together, less precise if the a>means 
are far apart. Hence the nonsignificance of f ⑷一 f ⑴ may be due to the fact that, just 
by chance, X[ t ], - 无 [i;. is rather large. It can very well happen then that two other 
LS means, say LSM(r ⑷) and LSM(r[q)，within the remaining set are significantly 
different from each other simply because their x-means are close together, that is, the 
quantity 


Qa 




2 + (%]• _%'].) 21 
r Exx 




(with k < t) is not only smaller than (8.38) but also smaller than - f^/]. The 
important point of this discussion is that even after a nonsignificant result for LS means 
of range l, we may have to continue comparing LS means of range less than 1. This 
may indeed lead to making all possible t(t ~ 1)/2 comparisons. 

The procedure just described may be rather tedious. An alternative, much sim¬ 
pler but possibly less satisfactory, method is to make all comparisons by using a con¬ 
stant variance, namely the average variance of all simple treatment comparisons. We 
find that 


av. var(fi 



(8.39) 



252 


CHAPTER 8. USE OF SUPPLEMENTARY INFORMATION 


Since we assume that the aj-values are not affected by the treatments, we would have 
that, on average. 


T xx 

t-^1 


E'xx 


(8.40) 


that is, the treatment mean square and the error mean square for the covariate are equal. 
If we use (8.40) in (8.39), the expression for the average variance reduces to 


av. var(fi - t^) 


f(r - 1) 


(8.41) 


independent of the actually observed : r-values. (A slightly different result was obtained 
by Cox (1957) by considering the covariates as normally distributed random variables). 
We may then use any of the multiple comparison procedures with (8.40) or (8.41). By 
doing so we must, however, realize that this procedure favors certain comparisons, 
those that have a variance larger than (8.41), over others, those that have a variance 
less than (8.41). Hence, care should be used in interpreting the results. Obviously, this 
procedure works quite well if the : r-means are not too different from each other. This 
is an ideal situation, one that has also been advocated by Cox (1982) for purposes of 
randomization analysis. 

Arguments similar to those above can be made for other multiple comparison pro¬ 
cedures, extending, for example, the Tukey procedure to the Tukey-Kramer procedure 
based on the result given by Kramer (1957). An example will be given in Section 8.8. 
For more details the reader is referred to Hochberg and Tamhane (1987). 


8.5 VIOLATION OF ASSUMPTIONS 

During our discussion so far we have made a number of assumptions, some implicit 
and some explicit. These assumptions can be summarized as follows: 

(i) There exists a linear relationship between the covariate x and the observation y. 

(ii) The relationship between y and x is the same for each treatment. 

(iii) The covariates are not affected by the treatments. 

(iy) The observations come from a normal distribution. 

In this section we shall consider these assumptions, how they may be checked in a 
given situation, and point to the implication of the violation of these assumptions. 

8.5.1 Linear Relationship between x and j 

Suppose two random variables x and y have a bivariate normal distribution with means 
\x x ^\i y , variances , and covariance pa x a y . Then the conditional distribution of 
y, given x, is normal with mean [L y + p(x — " x ) and variance crg(l — p 2 ), where 
0 = pay!a x . The variance cr^(l — p 2 ) is the variance of y about the regression line 
y = + p(^x — fi x ). In our situation we have t regression lines of the form y = 
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fj, y i + (3(x — = 1 ， 2,… ，艺 ) with fjLyi = /i + Ti, and the variance of y about each 

line is 4(1 — p 2 ) (see Figure 8.2)，where in our notation = a\. This implies that 
the increase in precision using the covariate x is given by (1 — p 2 ), or if we account 


for the fact that 3 has to be estimated and hence leads to an increase in the variance of 
comparisons, the increase in precision is on the average given by 


J=(l 


-P 2 ) 


t(r -1)-1 
亡 (r-1) 一 2 


(8.42) 


(Cochran, 1957; Cox and McCullagh, 1982). It follows then that an increase in preci¬ 
sion will be realized only if p is of a reasonable magnitude. If, for example, p 二 0, 
that is, there does not exist a linear relationship between x and y, then / > 1 and, 
in fact, information has been lost rather than gained. In order to get some idea how 
large p would have to be for the analysis of covariance to be worthwhile we consider 
(8.42) in connection with (8.41)，that is, we compare the average variance of treatment 
comparisons using the covariate 


av. var(fi - f^)= - 






t(r - 1) - 1 2 
t{r-r^-2 e 


with the average variance without the covariate 


av. var(fi - 千 i')= 



(8.43) 


(8.44) 


For (8.43) to be smaller than (8.44) we require 


t(r — 1) t(r — 1) — 2 
t(r — 1) + 1 t(r -1)-1 


In Table 8.3 we give minimal values of p for selected values of t and r. These should be 
viewed as rough guidelines only, especially for small values of t and/or r，since there 
may be substantial variation in precision between different randomization patterns and 
between different comparisons within one randomization pattern (Cox, 1957). 

The general conclusion from Table 8.3 is that for \p\ < .3, the use of covariates is 
of no real value. Substantial gains will be realized, however, if \p\ is large. 


8.5.2 Common Slope 

Implicit in model (8.5) and Figure 8.2 is the assumption that the linear relationship be¬ 
tween X and y is the same for all treatments, that is, the t regression lines are parallel. 
This is sometimes considered to be a serious and questionable assumption. This may 
be true for some situations in which the analysis of covariance is used, that is, observa¬ 
tional studies, but should generally not be a problem in a CRD if proper randomization 
has taken place. The assumption is, however, checked easily using the procedure de¬ 
scribed below. 

Consider the “full” model 

Vij U 0i{xij - x„) + error, (8.45) 
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Table 8.3 Minimal Values for Correlation between 
Observations and Covariates in CRD 


t 

r 

\p\ 

2 

5 

.49 


10 

.33 


15 

.27 


20 

.23 

3 

5 

.40 


10 

.27 


15 

.22 

4 

5 

.35 


10 

.23 


15 

.19 

5 

5 

.35 


10 

.21 

6 

5 

.29 


10 

.19 

7 

5 

.27 


where 叫 =// + 卩 (i = 1 ， 2, ..., j = 1,2, •.., r). We wish to test the hypothesis 
HqH 2 = … == ,3, say. 

We do this by fitting the model (8.45) and the “reduced” model (that is, assuming Hq 
is true) 

Vij = Mi + 0{xij - x.) + error (8.46) 


and obtain the sums of squares for both models, say SSf and SS 尺， respectively. We 
then test Hq by considering the F-statistic 


F 


(SS F -SS R )/(t-l) 

A( r - 2 ) 


(8.47) 


with t ~ 1 and t(r — 2) d.f. The NE for model (8.45) are 


rfk+ r(xi. - x.)0i = y L (i = 1 ， 2, … ， t) 


r(xi. - x,.)fLi + y^(xjj ~x.) 2 Pi = y^yij(Xij - x.) (i = 1,2, 
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It then follows that 


Ai = Vi. - - 龙 .•) 

A _ ~ ~~ Vi.) _ Sixy 

= ~^ 三 say 


ss F = ^2 屯访 .+ 戌 (m - 元 “、yij 

i ij 

= ^E^ + E^ 

/ ; ^IXX 

with 2t d.f. Similarly, for model (8.46) the NE are 

rp，i + r(x L - x..)0 = yi. (i = 1 ， 2,… ， it) 

r^2(xi, - x,.)jli + -x.) 2 p = ^2yij(Xij - x..). 

i ij ij 

It follows then [see also (8.16) and (8.15)] that 

Ai = Vi. - - X,) 

〉: Sixy 

6 = 」- = ^ 

y2 S ixx Exx 


(we mention here in passing that it follows from (8.48) and (8.50) that the estimate of 
/? is obtained by weighted pooling of the estimates of the individual 戌 )， and [see also 
(8.26)] • ^ ^ 


SS H = 






with t + 1 d.f. The test statistic (8.47) then takes on the form 


1 ^ixy 

^1 ^ixx 




^2^ ix 


_L_ 

1 q _ Si Xy 

~ 2^ ^y- 2^ s~ 

L i i J 
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with Ui = — (t 4* 1) = t — 1 and = tr — 2t = t(r — 2) as in (8.47). If 

F > Fi- a ,t-i,t(r-2) for a suitably chosen a, then we reject Ho and conclude that 
the slopes are not all the same. Since this is considered to be a preliminary test we 
may choose a = .25 rather than the customary a = .05 (see Bancroft, 1964). For a 
numerical example of this procedure see Section 8.8. 

If Hq is rejected it might be useful to investigate the data more closely, for example 
by plotting or by a formal test to see if the nonparallelism is due to perhaps one treat¬ 
ment. One then may delete that treatment and proceed with the analysis of the other 
treatments in the usual fashion. If no such simple explanation is plausible it is diffi¬ 
cult to prescribe what to do. In any case, model (8.5) is no longer appropriate, rather 
model (8.45) should then be used. In that case, however, treatment comparisons de¬ 
pend on the x-value at which they are compared and that may be rather unsatisfactory 
and misleading. 


8.5.3 Covariates Affected by Treatments 

To understand intuitively the problem that arises when the covariates are “affected” by 
the treatments, consider Figure 8.3. 

In this case low y-values are associated with low a:-values for (that is, treatment 
1) and high y-values are associated with high x-values for (that is, treatment 2 )， 
the two treatments we may want to compare. As an example，suppose the treatments 
are varieties of potatoes and we want to compare the yield of these varieties using as 
a covariate the size of the seed potatoes. It so happens that variety 1 has small seed 
potatoes and variety 2 has large seed potatoes. If we were to apply the analysis of 
covariance procedure we would compare the varieties at seed potato size a: = a 
value which may not be achieved by either variety. Hence, this procedure is obviously 
of no value. Similar arguments apply to situations where the covariates are affected 
by the treatments in other ways. For an interesting discussion the reader is referred to 
Smith (1957). 

We mentioned earlier that if the covariates are observed before the treatments are 
assigned they are certainly not affected by the treatments. But even in that case a 
situation as described in Figure 8.3 could arise for two reasons: 

(i) due to a particular outcome of the randomization process and (ii) due to a lack of 
randomization. In situation (i) one should throw out the randomization pattern and 
repeat the randomization process; in situation (ii) one should expose the motives of the 
investigator. In practice, of course, one may not be able to distinguish between (i) and 

(ii) . A method to protect oneself against this situation would be to subject the covariates 

x to an ANOVA for a CRD and consider F — T xx t{r — \)/[E xx {t — 1)] and if F is 
“large，” say larger than with q = .25, assume that the covariates are 

“affected” by the treatments and proceed accordingly. 
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Figure 8.3 Covariates affected by treatments. 


8.5.4 Normality Assumption 

Even though we have not invoked the assumption of normality explicitly (we have 
instead argued that the tests based on normality are an approximation to randomization 
tests), we have used the method of least squares to estimate parameters of model (8.5). 
It is well known that the LS method is optimal only if the observations are normally 
distributed. Hence the question arises: What should one do if the observations are not 
normally distributed. The obvious answer is to replace the LS method by any of the 
available “robust” methods, e.g. ， M-estimation (Huber, 1964). A limited study along 
these lines was done by Birch and Myers (1982) for heavy-tailed error distributions. 
We quote from their conclusions: 

The least squares procedures, although lacking efficiency in estimation of the 
parameters, show strong affinity toward the normal size in tests concerning param¬ 
eter values. The t-like tests based on M-estimators can be studentized to follow the 
t-distribution. Due to the similarity of the test results for LS and M-estimators it is 
suggested that both procedures be used together to provide a basis of comparison 
and diagnostic examination of the data. If parameter estimates differ significantly 
then outliers in the data should be strongly suspected and may be examined. Tests 
on parameters can be performed based on least squares and/or M-estimates using 
the 亡 -like or F-like procedures. 

We add that the tests referred to are those given in Section 8.3.5 and that given in 
(8.52). 





258 


CHAPTER 8. USE OF SUPPLEMENTARY INFORMATION 


Along similar lines Hocking (1982) suggested to use regression diagnostics such as 
the so-called hat matrix and residuals to detect deviations from normality through e.g., 
high-leverage observations. We shall not pursue these arguments here but rather refer 
the reader to some of the relevant literature in this area, for example, Belsley, Kuh and 
Welsch (1980), Myers (1990). For nonparametric procedures we refer to Conover and 
Iman(1982). 

8.6 ANALYSIS OF COVARIANCE WITH 
SUBSAMPLING 

As an extension of the analysis of covariance procedure presented so far, we now dis¬ 
cuss briefly the case involving subsampling (see Section 6.8). We can think of two 
situations: 

(i) The covariate is observable only for the EU; that is, Xij. 

(ii) The covariate is observed for each OU, that is, 

As an example of (i), consider a study to compare different drugs for their effec¬ 
tiveness in reducing blood pressure. Each patient (from a specified population) is given 
one of the drugs at random. To account for some variability among the patients, the 
blood pressure reading at the beginning of the trial is used as a covariate. At the end 
of the study duplicate blood pressure readings are obtained for each patient. An ex¬ 
ample for (ii) is the previously considered air pollution study (see Table 2.5) where 
each growth chamber represents the EU and the initial height of each plant (OU) in 
each chamber is used as a covariate. Remembering that the analysis of covariance is 
used as a device for reducing the experimental error it is only proper to treat both sit¬ 
uations in the same way. For (i) we use the covariate Xij as observed, and for (ii) we 
use as the covariate the average of the supplementary observations for each EU, that is, 

^ij = = ( 1 / 

As an extension of (8.5), the model for such data can then be written as 

Vijk = M + n + P(xij -x..) + +rj ijk (8.53) 

where i = 1, 2,... j = 1,2....,r 7 ; k = 1 ， 2, ... ， n and all the terms are as de¬ 
fined earlier (see Sections 8.3 and 6.8). Model (8.53) can be rewritten, for purposes of 
analysis, in terms of the average observation for the jth replication of treatment i as 

Vij. = " + n + 0{xij -x..) + (8.54) 

where e*j — £*j + fjij.. The form of the model suggests that the basic analysis can 
be carried out as described in Section 8.3, substituting for yij.al** for cr^*, and 
r / for r. More precisely, we obtain the entries in Table 8.4 using the as the 
“observations.” For example, 

Tyy 二 r, 〉 ： ( 没 i.. - V … ) 2 




8.7. CASE OF SEVERAL COVARIATES 


259 


Table 8.4 ANOYA for Model (8.53) 


Source 

d.f. 

SS 


X t | 3,X+X s . 

t-l 

T* - 
yy 

c*2 

°xy ^xy 

u xx ^xx 

X, 3 |3， X t ，X s * 

1 

^xy 

E* 

■^XX 


X e .\3 ， X”X 0 

t(r' -1)-1 

rp* 

匕 yv ~ 

jp*2 

^xy 

" 'E r 

I\3,X t .X 0 ,X s ， 

tr^n — 1) 

o* 

yy 


Total 

tr'n - 1 




For purposes of the ANOVA table, however, we need to reconvert everything to a per- 
observation basis by simply defining 

T；y = nTyy, = nE yy , = SS (OE) 

(see Table 6.8)，and so on. The resulting ANOVA table is then as given in Table 8.4 
using obvious notation. It should be clear, from our earlier discussion, how Table 8.4 
can be used. For example, to test Hq ： n = r 2 = = rt = 0 we use 

MS(X t 1J ; X /3 ,X £ Q 

""MS(X £ ， |3 ; X t ,X /3 ) 

with t — 1 and t(r f -1)-1 d.f. Furthermore, the sampling and experimental error 
variance components are estimated as 

斤卜 MSC^XnXAXy) 

and 

= MS(X g . IX r; X^) - MS(I|3, X r; X,^, X £ ，) 

^ _ n . 

8.7 CASE OF SEVERAL COVARIATES 

In our discussion of the analysis of covariance technique so far we have considered 
the simplest but most important situation, namely that of one covariate and a linear 
relationship between the covariate x and the observation y. There may, however, be 
situations where the relationship between x and y is of a polynomial form or it may 
be useful to consider several covariates X\,X 2 ^ • > which have a linear or polynomial 
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relationship with y. We have described and dealt with the general model involving both 
classificatory and regression parts in Section 4.14. Below we give a slightly different 
derivation following Cox and McCullagh (1982) (see also Scheffe, 1959). 


8.7.1 General Case 

Using matrix notation we write the general analysis of covariance model for the N x 1 
vector of observations y as 

y = Xfi + Z 7 + e*, (8.55) 

where X/i represents the classificatory part (treatments in our case) and Z 7 represents 
the regression part (the covariates in our case), X and Z are matrices of known con¬ 
stants of dimensions N x d x and N x d z ， respectively, /x and 7 are d x x 1 and d z x 1 
vectors, respectively, of unknown parameters, and e* is a iV x 1 vector of errors with 
E(e*) = 0, and var(e*) = If no covariates are included or, alternatively, if 

7 = 0, then model (8.55) reduces to 


y = X/x + e. (8.56) 

We shall refer to (8.56) also as the design model. We know (see Chapter 4) that for 
(8.56) an orthogonal decomposition of y is given by 

y = + [I- X(X , X)- 1 X , ]y - P x y 丄 R x y ， (8.57) 

where 

P x = X(X , X)~ 1 X , and R x = [I - X(X , X)~ 1 X , j = I - P x 


are N x N idempotent matrices (we assume here that the parameterization in (8.55) 
and (8.56) is such that rank(X) = d x ). In (8.57) Rxy is the vector of residuals and 
y ; Rxy is the residual sum of squares. We now rewrite (8.55) as 


y = X " ⑼ +RxZ7 + e* 

=X[ M(0) - (X , X)- 1 X , Z 7 ] + Z 7 + e* 

so that 

M M ⑼- (x'xrWz，. 

Using (8.58) the NE are obtained as 


XX 

XR x Z 

A(o) 


'XV ■ 

ZR x X 

ZR X Z 

. 7 . 


ZR x y 


which reduces to 


X，X 

0 ' 

A(o) 


'xv ' 

0 

ZR X Z 

. 7 . 


Z'R x y 


From (8.60) we obtain immediately 

= (x'xfxv ， 


(8.58) 

(8.59) 


(8.60) 


(8.61) 
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that is, the estimator for /x under the design model, 

7 = (Z , R x Z)- 1 Z , R x y 


(8.62) 


and from (8.59) 

A = A ⑼ - (x’xrtx'Zr (8.63) 

It also follows from the special form of (8.60) that 

var(A (0) ) = (Mj-V, 2 . 

and 

var( 7 ) = (Z'RxZ)- 1 ^. (8.64) 

and, since /i ⑼ and 7 are uncorrelated, 

var(/i) = [(X’X )- 1 + (Z^xZ)- 1 ]^,- (B.65) 

We comment briefly on the form of (8.62). The elements of Z’RxZ are the error sums 
of squares (diagonal elements) and error sums of products (off-diagonal elements) for 
the design model (8.56) when the columns of Z are used as the “observation” vectors. 
Similarly, the elements of the vector Z^Rxy are the corresponding error sums of prod¬ 
ucts using successively the columns of Z with the observation vector y. This presents 
an easy way of obtaining 7 and hence ft as we shall illustrate in Section 8.7.2. 

The error sum of squares, SS(I|X. Z), is obtained in the usual way as 

SS(I|X,Z)=yV- ft{o)^y - fZRxy. (8.66) 

It is instructive to write ( 8 . 66 ) as 

SS(I|X,Z) = SS(I|X)- 7 / Z / R x y 


which shows that the error sum of squares for model (8.55) is smaller than the error 
sum of squares for model (8.56), and the reduction is given by 7 ’Z’Rxy. From ( 8 . 66 ) 
we then obtain 


& 2 e . = MS(I|X.Z) 

= SS(l\X,Z)/(N -d x -d z ). 


Finally, to test any hypothesis about p or a subvector of 



say Ho ： = /i*, we fit the model 

y = x 



(8.67) 
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say, and obtain SS(I|X*, Z) in the same way as we obtained (8.66). Suppose Ho is of 
rank d. We then form the F-statistic 


[SS(I|X-,Z)-SS(I[X ; Z)]/d 
上 — MS(I|X^Z) 


( 8 . 68 ) 


or alternatively, 

[SS(X ， Z)-SS(X* ， Z)]/d 
- MS(I|X ， Z) 

with d and N — d x - d z d.f. For /xj = 0 this procedure is derived explicitly in 
Section 4.14.2. 


8.7.2 Two Covariates 

We shall illustrate the procedure described above in terms of a simple example in the 
context of the CRD. Suppose we have t treatments, each replicated r times, and two 
covariates x and : (for polynomial regression with one covariate x and a quadratic 
relationship, we can use this technique by taking the second “covariate” to be z = x 2 ). 
Then, in model (8.55) we have 

M = (Ml ， "2, • . . ， Mt)， 

where for the CRD we have "i = "■ + Ti(i = 1 ， 2, • • • ， 亡)， 



where 3 r is a r x 1 column vector of unity elements and X contains t such vectors, 


7’ = (7i ， 72) 

'^ii z u 
x 12 Z 12 

Z = ::， 

where x*j = Xij - - 乏 ..，and and are the covariates for the 

jth replication of treatment i. In a practical setting the treatments may be different 
advertising strategies for a book of general interest, the EUs are comparable book stores 
in different cities, x may be the sales volume of a bookstore in the previous month, z 
may be the price of the book established prior to the advertising campaign (there being 
slight differences in price due to local conditions), and y being the sales volume of this 
book during a specified period. 

We then find from (8.61) 


A(o) = (yi. ： y2.,---,Vt.y 
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and from (8.62) 



where E xz , E zz , E zy are defined in an obvious way as an extension of the terms in 
Table 8.1 and as described above as error sums of squares and error sums of products 
for the CRD. Further, from (8.63) we obtain the ith component of /x as 

Ai = Vi. - - 12Z* 

= Vi. - - X..) - 72(2i. - Z..). 


Also, from ( 8 . 66 ) we find 

SS(Error) = SS(I|X,Z) 

= Eyy - E X y^)\ — E Z y^2 

and 

^e. = {Eyy - E xy ^i - E zy - f2 )/[t(r - 1) - 2]. (8.69) 

Finally, to test the equality of treatment effects H 0 : fii = = .. • = = "，model 

( 8 . 68 ) takes on the form 

y = 3 以 + Z 7 + e*. 

The NE then yield the estimators /x, 71 ? 72 for /x, 71 , 72 , respectively, as 

A = y- 

s xx S xz \(^\^ 

s zx s zz ) \ 12 )~ 

and hence 

SS(I|3, Z) = Syy - S X y^\ - S Z y^2 

so that ( 8 . 68 ) becomes 

F — (Tyy - Sxy'Ji — Szy^ + Exy^fi + E zy j 2 ) / (t — 1 ) 

(Eyy - E X y^l - E Z y% 、 l\t、T - 1 )— 2 ] 

Tests for 71 and 72 can, of course, be derived in a similar fashion. 

The general case with m covariates, x\. X 2 ,.. •, x m say, is discussed in Section 4.14.4. 
It is shown there, as is somewhat intuitive from the case m = 2 above, that the arith¬ 
metic can be represented by the ANOVA of y, X i? ..., X m and the corresponding 
sums of products. 

Many of the problems discussed in Section 8.5 can arise in the multiple covariate 
situation as well and extra care must be taken to assure validity of the basic assump¬ 
tions. For example, problems of collinearity may arise and appropriate diagnostics 
and/or different estimators for the regression coefficients in 7 may have to be used as 
described for example by Myers (1990). The problem may become rather complicated 
and in the end not worth the effort as there may be only marginal reduction of error as 
the number of covariates increases. 
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8.8 EXAMPLES USING SAS® 


Even though the analysis of covariance in its simplest form is easy to perform, com¬ 
puter programs have, nevertheless, led to a much wider and more common use of this 
method of reducing experimental error. In the following we shall give some examples 
and illustrate the use of SAS Proc GLM. 

Example 8.1: Consider the experimental situation described in Exercise 8.3 with 
the data given also in Table 8.5a. The input statements for SAS PROC GLM are given 
in Table 8.5a. There we have included several options such as “inverse” ， “solution”, 
and “e” the reason for which we shall explain in the comments for the output, given in 
Table 8,5b: 

(i) The Type III SS in the ANOVA show that there are differences among the treat¬ 
ments (P < .0001) and that the regression coefficient is different from zero 
(P = 0.0005). 

(ii) The solution vector (which is produced because of the “solution” option, required 
for classificatory models) gives /? = .773 with standard error se(8) = 0.16. 

(iii) The se(8) can also be obtained from the X'X Generalized Inverse as 

se(§) = (0.03358 x 0.7656) 1/2 

where .7656 = from the ANOVA table. 

(iv) The General Form of Estimable Functions (obtained because of the option “e” 
in the model statement) can be used to interpret the solutions for the treatment 
effects. For example, for treatment 1 we have 

6.2245 = 

which is obtained by putting L2 = 1 and all other Li = 0, with 

se(Ti^rs) = 0.6021. 

(v) The generalized inverse can also be used to obtain, for example, 

= [(.4735 + .4113 — 2 x .1711) x ,7656] 1/2 . 


(vi) The “e” option for LSmeans shows us how to obtain the LSmeans. To do so, 
however, we need to mention that instead of model (8.5) SAS uses the model 
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Table 8.5 Basic Analysis of Covariance 


a) Input statements: 

data ancova; 
input trt x y 
datalines; 

1 1 57.0 1 2 55.0 1 3 62.1 1 4 74.5 1 5 86.7 1 6 42.0 

2 1 64.8 2 2 66.6 2 3 69.5 2 4 61.1 25 91.82651.8 

3 1 70.7 3 2 59.4 3 3 64.5 3 4 74.0 3 5 78.5 3 6 55.8 

4 1 68.3 4 2 67.1 4 3 69.1 4 4 72.7 4 5 90.6 4 6 44.3 

5 1 76.0 5 2 74.5 5 3 76.5 5 4 86.6 5 5 94.7 5 6 43.2 

run; 

proc print data=ancova; 

title ’DATA FOR CRD WITH SUPPLEMENTARY INFORMATION，; 
run; 

proc glm data=ancova; 
class trt; 

model y=trt x/ inverse solution e; 
means trt; 

lsmeans trt/stderr e; 

title ’BASIC ANALYSIS OF COVARIANCE ，； 
run; 

b) Output: 


DATA FOR CRD WITH SUPPLEMENTARY INFORMATION 
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Table 8.5 {Continued) 


BASIC ANALYSIS OF COVARIANCE 
The GLM Procedure 
Class Level Information 
Class Levels Values 

trt 3 123 


Number of Observations Read 15 

Number of Observations Used 15 


X Generalized Inverse (g2) 



Intercept 

trt 1 

trt 2 

Intercept 

0.8739556749 

-0.422646071 

-0.11274681 

trt 1 

-0.422646071 

0.4735527199 

0.1711752854 

trt 2 

-0.11274681 

0.1711752854 

0.4112961719 

trt 3 

0 

0 

0 

X 

-0.150436535 

0.0496977837 

-0.019476158 

Y 

2.7154466085 

6.2245399597 

2.9314640698 


X r X Generalized 

Inverse (g2) 



trt 3 

X 

y 

Intercept 

0 

-0.150436535 

2.7154466C85 

trt 1 

0 

0.0496977837 

6.2245399597 

trt 2 

0 

-0.019476158 

2.9314640698 

trt 3 

0 

0 

0 

X 

0 

C.0335795836 

0.7733378106 

y 

0 

0.7733378106 

8.4220302216 


General Form 

of 

Estimable Functions 

Effect 


Coefficients 

Intercept 


LI 

trt 

1 

L2 

trt 

2 

L3 

trt 

3 

L1-L2-L3 

X 


L5 
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Table 8.5 {Continued) 


BASIC ANALYSIS OF COVARIANCE 
The GLM Procedure 


Dependent Variable : y 





Sum of 





Source 


DF 

Squares 

Mean Square 

F 

Value 

Pr > F 

Model 


3 

84.67796978 

28.22598993 


36.87 

<•0001 

Error 


11 

8.42203022 

0.76563911 




Corrected 

Total 

14 

93.10000000 






R-Square 

Coeff Var Root 

MSE y 

Mean 



0.909538 

9. 

722312 0.875008 9.000000 


Source 


DF 

Type I SS 

Mean Square 

F 

Value 

Pr > F 

trt 


2 

66.86800000 

33.43400000 


43.67 

<.0001 

X 


1 

17.80996978 

17.80996978 


23.26 

0.0005 

Source 


DF 

Type III SS 

Mean Square 

F 

Value 

Pr > F 

trt 


2 

83.14658219 

41.57329109 


54.30 

<•0001 

X 


1 

17.80996978 

17.80996978 


23.26 

0.0005 


Parameter 


Estimate 


Standard 

Error 

t Value 

Pr > 11 j 

Intercept 


2.715446608 

B 

0.81800651 

3.32 

0.0068 

trt 

1 

6.224539960 

B 

0.60213826 

10.34 

<.0001 

trt 

2 

2.931464070 

B 

0.56116347 

5.22 

0.0G03 

trt 

X 

3 

0.000000000 

0.773337811 

B 

0.16034289 

4.82 

0.0005 


NOTE : The X'X matrix has been found to be singular, and a generalized inverse 
was used to solve the normal equations. Terms whose estimates are 
followed by the letter , are not uniquely estimable. 
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Table 8.5 (Continued) 

The GLM Procedure 


Level of - y - - x - 

trt N Mean Std Dev Mean Std Dev 

1 5 11.2600000 1.32400906 3.00000000 1.2C415946 

2 5 9.5600000 1.92691463 5.06000000 1.75157072 

3 5 6.1800000 1.04498804 4.48000000 1.71084774 


Least Squares Means 

Coefficients for trt Least Square Means 

trt Level 

Effect 1 2 

Intercept 
trt 1 

zrt 2 

trt 3 



Standard 

trt y LSMEAN Error Pr > |t : 

1 12.1725386 0.4346564 <-0001 

2 S.8794627 0.4159778 <.C001 

3 5.9479987 C.394261C <.0001 


Then the coefficients for trt Least Square Means tells us that, for example, 


LSmean(trt 1) = " + Ti +/? . 4.18 

= 2.7154 + 6.2245 4 - .7733 - 4.18 
=12.1723 ， 

which is, apart from rounding error, equal to the LSmean (trt 1) given in the SAS 
output. Here " and are part of the solutions to the NE obtained by SAS (using 
t 3 = 0), and 4.18 = x,. 

(vii) The se[LS mean (trt 1)] can be obtained by using the generalized inverse, 5 ^， 
and the coefficients for the LS mean as 

se [LSmean(trt) 1] = [(.8740 + .4736 + (4.18) 2 x 
.0336 — 2 x .4226 一 2 x .1504 
+ 2 x 4.18 x .0497) x .7656] 1/2 = .435 

(viii) Just for comparison we give y-means in addition to the LS means to show that 
LS mean for trt 1 is adjusted upwards, whereas those for trt 2 and 3 are adjusted 
downwards as an illustration of Figure 8.2. 
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Example 8.2: We consider the same data as in Example 8.1. In addition to the 
analysis of covariance we now consider post-hoc comparisons in the form of orthogo¬ 
nal contrast and illustrate the Tukey-Kramer procedure for multiple comparisons (see 
Section 8,4.2). The input statements are given in Table 8.6a and the output in Table 
8.6b. 

We make the following comments: 

(i) In the input statement, in order to perform the Tukey procedure for the LSmeans 
we have to specify “adjust=Tukey” (other multiple comparison procedures are 
available, but not Duncan’s multiple range test). 

(ii) The Tukey-Kramer adjustments apply, of course, only to the multiple compari¬ 
son tests and simultaneous confidence intervals. 

(iii) The low P-values, indicating highly significant differences between the treat¬ 
ments, correspond to lower and upper confidence limits having the same sign for 
each pairwise comparison. 

(iv) Note that the contrast sums of squares do not add up to the treatment sum of 

squares (see Section 7.3). □ 


Example 8.3: Using the data from Example 8.1 we shall demonstrate the use of 
SAS PROC GLM to obtain separate regression coefficients for each treatment and then 
test for equality of slopes. This is done in two steps with the input statements given in 
Table 8.7a. 

For fitting separate regression lines we consider the regressions, technically speak¬ 
ing, to be nested within treatments, expressed as “a:(trt)’’. This is equivalent to model 
(8.45). The procedure for testing for equality of slopes is different than the method de¬ 
scribed in Section 8.5.2. The input statement “; r:r*trt’’ results in “fitting” first a single 
slope, indicated by and then considers deviations from the single slope, indicated 
by “; r*trt”. Testing x^tri = 0 is then equivalent to testing Hq : = 02 = … =0t. 

The results of these two procedures are given in Table 8.7b: 

(i) Using the “solution” option provides (3\ = .976, 02 = .896, 03 = .545 as the 
estimates of the three regression coefficients. 

(ii) The results in (i) may suggest that the regression coefficients may not be equal, 
but the test for “x*trt” is not significant (P = .5548), from which we conclude 
that the assumption of a common slope for the three treatments is reasonable. 

(iii) Looking at the solution vector for x and x*trt we recognize that the single slope 

mentioned above is actually 0s = .5448 and x^trt 1 = /?i — P 3 = .4311; 
x*trt2= P 2 ~ 03 — *3509. □ 



Table 8.6 Analysis of Covariance with Post-Hoc 
Comparisons 


a) Input statements: 


data ancova; 
input trt xy 
datalines; 

1 4.1 12.5 1 2.9 10.3 11.5 9.614.3 12.6 1 2.2 11.3 

2 6.8 11.5 2 2.7 8.6 2 3.8 7.2 2 6.4 11.6 2 5.6 8.9 

3 6.6 6.8 3 2.2 4.8 3 3.5 5.6 3 5.5 7.5 3 4.6 6.2 


run; 

proc glm data=ancova; 
class trt; 
model y=trt x; 

Ismeans trt/stderr pdiff cl adjust=Tukey ; 

contrast ’ 1+2 vs 3’ trt 1 1-2; 

estimate ’1+2 vs 3’ trt 1 1 -2/divisor=2; 

contrast *1 vs 2* trt 1 -1; 

estimate ’1 vs 2’ trt 1 -1; 

title 1 ’ANALYSIS OF COVARIANCE ’； 

title2 WITH POST-HOC COMPARISONS’; 

run; 

b.) Output: 


ANALYSIS OF COVARIANCE 
WITH POST-HOC COMPARISONS 

The GLM Procedure 

Class Level Information 

Class Levels Values 

trt 3 123 


Number of Observations Read 15 

Number of Observations Used 15 


Dependent Variable : y 




Sum of 






Source 

DF 

Squares 

Mean Square 

F 

Value 

Pr 

> F 

Model 

3 

84.67796978 

28.22598933 


36.87 

<. 

00C1 

Error 

11 

8.42203022 

0.76563911 





Corrected Total 

14 

93.10000000 






Source 

DF 

Type I SS 

Mean Square 

F 

Value 

Pr 

> F 

trt 

2 

66.86800000 

33.43400000 


43.67 

<• 

0001 

X 

1 

17.80996978 

17.80996978 


23.26 

0 . 

0005 

Source 

DF 

Type III SS 

Mean Square 

F 

Value 

Pr 

> F 

trt 

2 

83.14658219 

41.57329109 


54.30 

< . 

0001 

X 

1 

17.80996978 

17.80996978 


23.26 

0 . 

0005 
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Table 8.6 {Continued) 


Least Squares Means 

Adjustment for Multiple Comparisons : Tukey-Kramer 


trt y LSMEAN 


Standard 

Error Pr > |tI 


LSMEAN 

Number 


12.1725386 

8.8794627 

5.9479987 


0.4346564 <.0001 
0.4159778 <.0001 
0.3942610 <.0001 


Least Squares Means for effect trt 
Pr > !tI for HO: LSMean(i)=LSMean(j) 

Dependent Variable : y 


3 


1 0.0009 

2 0.0009 

3 <.0001 0.0008 


<.0001 


0.0008 


trt 


y LSMEAN 95% Confidence Limits 


12.172539 

8.879463 

5.947999 


11.215866 

7.963902 

5.080236 


13.129211 

9.795024 

6.815761 


Least Squares Means for Effect trt 


Difference 

Between 

Means 


Simultaneous 95% 
Confidence Limits for 
LSMean(i)-LSMean(j) 


2 

3 

3 


3.293076 

6.224540 

2.931464 


1.552453 

4.598281 

1.415870 


5.033699 

7.850799 

4.447058 


ANALYSIS OF COVARIANCE 
WITH POST-HOC COMPARISONS 

The GLM Procedure 


Dependent Variable : y 


Contrast 

DF 

Contrast SS 

Mean Square 

F Value 

Pr 

1+2 vs 3 

1 

68.31196748 

68.31196748 

89.22 

< . 

1 vs 2 

1 

19.98964494 

19.98964494 

26.11 

0 . 


Standard 

Parameter Estimate Error t Value Pr 


> F 

0001 

0003 

> lt| 


1+2 vs 3 
1 vs 2 


4.57800201 
3.29307589 


0.48466275 

0.64448269 


9.45 

5.11 


<.0001 
0.0003 
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Table 8.7 Analysis of Covariance: Fitting Separate Regressions and Testing for 
Equality of Slopes 


a) Input statements: 

data ancova; 
input trt x y @ @; 
datalines; 

14.1 12.5 1 2.9 10.3 1 1.5 9.6 1 4.3 12.6 1 2.2 11.3 

2 6.8 11.5 2 2.7 8.6 2 3.8 7.2 2 6.4 11.6 2 5.6 8.9 

3 6.6 6.8 3 2.2 4.8 3 3.5 5.6 3 5.5 7.5 3 4.6 6.2 

run; 

proc glm data=ancova; 
class trt; 

model y=trt x(trt)/solution; 

title 1 ’CRD WITH SUPPLEMENTARY INFORMATION ’； 
title2 ’FITTING SEPARATE REGRESSION LINES ’； 

run; 

proc glm data=ancova; 
class trt; 

model y=trt x x*trt/solution; 

title2 TESTING FOR EQUALITY OF SLOPES ’； 

run; 

b.) Output: 


CRD WITH SUPPLEMENTARY INFORMATION 
FITTING SEPARATE REGRESSION LINES 

The GLM Procedure 

Dependent. Variable : y 

Sum of 


Source 


BF 

Squares 

Mean Square 

F 

Value 

Pr > F 

Model 


5 

85.71133848 

17.14226770 


20.88 

0.0001 

Error 


9 

7.38866152 

0.82096239 




Corrected 

Total 

14 

93.10000000 






R-Square 

Coef f 

Var Root 

MSE y 

Mean 



0.920637 

10.06744 0.906070 9.000000 


Source 


DF 

Type I SS 

Mean Square 

F 

Value 

Pr > F 

trt 


2 

66.86800000 

33.434000C0 


40.73 

<.0001 

x (trt) 


3 

18.84333848 

6.28111283 


7.65 

0.0076 

Source 


DF 

Type III SS 

Mean Square 

F 

Value 

Pr > F 

trt 


2 

6.13915023 

3.06957511 


3.74 

0,0653 

x (trt) 


3 

18.84333848 

6.28111283 


7.65 

0.0076 



Table 8.7 {Continued) 


0.0658 
0.0013 
0.5548 


2 6.13915023 3.06957511 

1 17.20712309 17.20712309 

2 1.03336870 0.51668435 


Parameter 

Intercept 
trt 
二 rt 
trt 
x (trt) 
x(trt) 
x(trt) 


Estimate 

3.739494363 B 
4.592919430 3 
1.288276172 B 
O.OOOOOOOCO E 
0.375862069 
0.895697523 
0.544755723 


Standard 

Error 

1.25360461 
1.73482684 
1.85702C75 

0.37622499 

0.25864492 

0.2648014C 


t Value 

2.98 

2.65 

0.69 

2.59 

3.46 

2.06 


Pr > 111 

0.0154 

0.0266 

0.5054 

0.0290 

0.0071 

0.C698 


NOTE : The X matrix has been found to be singular, and a generalized 
inverse was used to solve ~he normal equations. Terms whose 
estimates are followed by the letter r B f are not uniquely estimable. 


CRD WITH SUPPLEMENTARY INFORMATION 
TESTING FOR EQUALITY OF SLOPES 


Dependent Variable : y 


Source 

Model 

Error 

Corrected Total 


DF Squares Mean Square F Value Pr > F 

5 85.71133848 17.14226770 20.88 0.0001 

9 7.38866152 0.82096239 

14 93.1000000C 


R-Square 
0.920637 


Coeff Var 
10.06744 


Root MSE 
C.906070 


y Mean 
9.000000 


Source 

trt 

x 

x*r.rt 

Source 


DF 

2 

2 


Type I SS 

Mean Square 

Y 

Value 

Pr > F 

66.8680C000 

33.43400000 


40.73 

<.0001 

17.80996978 

17.80996978 


21.69 

0.0012 

1.03336870 

0.51668435 


0.63 

0.5548 

Type III SS 

Mean Square 

F 

Value 

Pr > F 


Parameter 


Estimate 


Intercept 


3.739494363 

B 

trt 

1 

4.592919430 

B 

trt 

2 

1.288276172 

B 

trt 

3 

0.000000000 

B 

X 


0.544755723 

B 

x*trt 

1 

0.431106346 

B 

x*trt 

2 

0.350941800 

B 

x^trt 

3 

O.COOOOCOOC 

B 


NOTE : The X f X matrix has been found 
inverse was used to solve the 
estimates are followed by the 
estimable. 


Standard 

Error 

t Value 

Pr > |:| 

1.2536C461 

2. 

98 

0.0154 

1.73482684 

2 . 

65 

C. 0266 

1.85702075 

0 . 

69 

0.5054 

0.26480140 

2 . 

C6 

0.0698 

0.46007067 

0 . 

94 

0.3732 

0.370158C4 

0 . 

95 

0.3 67 8 


to be singular, and a generalized 
normal equations. Terms whose 
letter f B' are not uniquely 


4 6 3 
7 9 6 
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8.9 EXERCISES 

8.1 Consider the following data (y, x) from a CRD, where y represents the response 
after treatment, and a: is a covariate. 

(7.5, 3.5) (6.2, 2.6) (6.8, 3.1) 

(10.0, 3.0) (12.1 ， 3.7) (11.3, 4.1) 

(15.2, 5.1) (10.7, 2.6) (12.9, 3.1) 

(5.1 ， 4.2) (4.6, 3.7) (7.1 ， 4.9) 

(12.1 ， 2.3) (14.2, 2.9) (15.0, 3.5) 

(10.0, 3.2) (9.8, 3.0) (9.6, 2.5) 

(i) Analyze the data, ignoring the covariate, that is, 

(a) obtain treatment means, 

(b) obtain ANOVA table and perform F-test, 

(c) perform Tukey’s Test (a = .05), 

(d) interpret the results. 

(ii) Using the experiment as a pilot study and, again ignoring the covariate, 
determine the number of replications per treatment needed to detect a dif¬ 
ference between the best and the poorest treatment of 3 units or more with 
probability .8, using a test of size a = .05. 

(iii) Do the same as in (i) using the covariate. 

(iv) Do the same as in (ii) using the covariate. 

(v) Comment on the results from (i), (ii) vs. (iii), (iv). 

8.2 An experiment was conducted to compare six different management techniques 
(such as pruning, spraying and fertilizing) for apple trees with respect to yield. 
Each apple tree represents an experimental unit and the trial was layed out as 
a completely randomized design with 5 replications for each management tech¬ 
nique. All the trees underwent the same management practice before the trial. 
For each tree the yield in bushels (x) for the four-year period preceding the trial 
is available. At the end of the four-year experimental period, the yield in pounds 
of apples (y) is obtained for every tree. 

Suppose the partial SAS Proc GLM printout is as follows: 


Treatment 


Source Type I SS Type III SS 
Treatments 40 60 

Prev. Yield 26 26 

Error 46 


Total 112 


Based on this information indicate how you would answer the following ques¬ 
tions: 
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(i) Plot the data. 

Using the methods and formulae described in Section 8.5.2, 

(ii) Estimate the regression line for each treatment. 

(iii) Test the hypotheses that the three slopes in (ii) are equal. 

(iv) Obtain the pooled estimate of the slope. 

(v) Obtain the unadjusted and the adjusted treatment means and compare them. 

(vi) Obtain the ANOVA table. 

(vii) Interpret the results obtained from the ANOVA table. 

Compare the results with those obtained in Examples 8.1 - 8.3. 


(l) Are there differences among the management techniques (treatments)? 

(ii) Has the use of a covariate been successful in reducing the variance for 
treatment comparisons? 

(iii) Suppose the supplementary information were not available. How would 
you test Ho: ti 二 ^ = ... = tq? 

(iv) Suppose xi, = 10, x 2 . = 12, x 3 . = 9, x A , = 10, x h , = 13, x 6 ,= 
8, T,(xij — Xi) 2 = 20. What is the standard error of the comparison 
“treatment 1 vs. remaining treatments?” 

8.3 Suppose an engineer is interested in comparing three chemical processes for 
manufacturing a certain compound. She suspects that the impurity of the raw 
material used in the processes will affect the final product. She therefore wants 
to adjust for that in the final analysis. 

Using a CRD with 15 experimental units she records the following: 

Amount of 

Treatment Impurity Yield 


.5 .3 6.6 .3 . 562 . 6988652 
12109.12 1111 8 . 7 . 11 8 . 6 . 4 . 5 . 7 . 6 . 


. 1.9.51.2.8.7.8.4ivq.2.6 
42.L4.26.2.3.6.5.6.2.3.5.4. 


2 3 
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8.4 Show that 

、 2 1 T xx 9 

av. var (六 一 7v) = - 1 + — 

广 L t — 丄 -t^XX _ 

[see (8.39)]. 

8.5 Show explicitly that 3 as given in (8.50) is the weighted average of the esti¬ 
mates of the individual 戌 where the weights are the reciprocals of var ( 戌 )(i = 
1，2，...彳). 




CHAPTER 9 


Randomized Block Designs 


9.1 INTRODUCTION 

As we have mentioned on several occasions，one of the major objectives of consid¬ 
ering designed experimentation is to reduce error in order to improve the sensitivity 
or precision of the investigation. Hence comes our use of the word error-reduction or 
error-control design (see Chapter 2) for an experimental plan which produces an error 
variance smaller than that for a comparable CRD for purposes of treatment compar¬ 
isons. In Chapter 8 we have already considered one method of reducing the error. This 
was not achieved through a more complex design but rather by making effective use of 
additional information. 

In this chapter we shall consider the situation referred to in Section 8.1 where the 
variability among the experimental units available for a study is systematic rather than 
“random.” Such variation may arise naturally or may be “induced” or introduced by 
the experimenter. Both situations are treated identically from the design point of view 
but may have to be treated differently from an analysis point of view. We shall discuss 
this later but shall give first some examples of both situations. 

In a field experiment there may be a fertility gradient (due to sloping land, or ex¬ 
ample) such that EUs on the same gradient level are more alike than those at different 
levels; or there may be a creek running through the field such that plots equidistant 
from the creek are more alike than those at different distances from the creek (Pearce, 
1983). In a clinical trial, to achieve adequate numbers of replications, several centers 
may be involved and patients (EUs) in the same center may be more alike than patients 
from different centers, not so much because of their own personal characteristics, but 
because of different treatment practices or management styles in different centers. 

Induced variability is often considered when one wants to broaden the scope of the 
validity of experimental findings. An investigator in an industrial experiment may de¬ 
cide to obtain experimental material from different suppliers who use different produc¬ 
tion processes. In a livestock feeding trial it may be important to include animals from 
different breeds; or in an experiment to test different brands of tires one may want to 
include cars from different manufacturers and different models for each manufacturer. 
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It should be clear from these examples that there are many situations with system- 
atic variation among sets of EUs. In such situations it is obviously inappropriate to 
use a CRD and the aim of designing the experiment must be to take this variation into 
account and to “eliminate” the effect such variability would have on the precision of 
treatment comparisons. This leads us to the concept of local control or blocking alluded 
to in Chapter 2. This concept, introduced by R. A. Fisher (1926,1935), is indeed one of 
the most important concepts in the subject of experimental design and all error-control 
designs discussed in this chapter and the following chapters make use of it in one form 
or another. 

To conclude this section we shall relate our discussion above to our general de¬ 
velopment in Section 2.2.4, where we divided the set of blocking factors in intrinsic 
factors, denoted by Z, and non-specific factors, denoted by U. An example of an in¬ 
trinsic factor is given by the inclusion of different breeds in a feeding trial, whereas 
an example of a non-specific factor is the recognition of a fertility gradient due to 
sloping land in an agronomic trial. From the design point of view this distinction gen¬ 
erally is not important, but it may be important with regard to dealing with possible 
blockx treatment interactions (see Section 9.6) and considerations of practical infer¬ 
ence from an experiment involving blocking factors. 

9.2 RANDOMIZED COMPLETE BLOCK 
DESIGN 

9.2.1 Definition 

The simplest and perhaps most widely used block design is the randomized complete 
block design (RCBD) which we define as follows: The experimental material is divided 
into b sets of t EUs each, where t is the number of treatments, such that the EUs within a 
set are as homogeneous as possible and that differences among the EUs are accounted 
for as much as possible by differences between the sets. The sets are called blocks. 
Within each block the t treatments are randomly assigned to the EUs, each treatment 
occurring exactly once in a block. Independent randomizations are used in the b blocks. 

The physical act of randomization can be carried out for each block as described in 
Section 6.2.1 or by using SAS PROC PLAN as described in Table 9.1 for i = 6 and 
b = 3. 

As alluded to above the division of the EUs into blocks is based on a priori in¬ 
formation or what we have called the subject matter model (see Section 2.2), that is, 
identification of factors that may have an effect on the outcome of the experiment. It 
is important to identify these factors since otherwise, if only by chance, the treatments 
may be confounded, that is, not separable, from the “levels” of such extraneous or 
nuisance factors (for an example see Kempthorne, 1952). 

9.2.2 Derived Linear Model 

We shall now consider the analysis of data from an RCBD following the method of 
Kempthorne (1952, 1955) and using an approach similar to that in Chapter 6. This 
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Table 9.1 Randomization for Randomized Complete Block Design 


a) Input statements: 

proc plan seed=23467; 

factors block=3 ordered trt=6; 

title 1 ’RANDOMIZATION FOR RANDOMIZED COMPLETE BLOCK DESIGN，; 
title2 ， t=6, b=3 ，； 

run; 

b) Output: 


RANDOMIZATION FOR RANDOMIZED COMPLETE BLOCK DESIGN 

t=6, b=3 


Factor 


The PLAN 

Procedure 


Select 

Levels 

Order 


block 

trt 


3 3 Ordered 

6 6 Random 




block 


trt - 
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means we shall first derive a linear model and then describe the analysis of variance 
associated with that model. We consider first the case where EU and OU are identical. 
Let Tijk denote the true or conceptual yield of treatment k applied to the jth EU in the 
ith block (i = 1,2, j = 1,2, • •. ，亡 ; fc = 1. 2,... ,t). We now assume, as we did 
in Section 6.3, treatment-unit additivity in the strict sense. Hence we write 

Tijk = Uij + T’k, (9.1) 

where Uij is the contribution from EU j in block i and Tk is the contribution from 
treatment k. Using the fact that we have formed blocks and that randomization is 
performed within blocks, that is, we have restricted randomization, we rewrite (9.1) as 

Ajk = Bi + Uij Tj^ (9.2) 

where Bi = = \J i% is the average unit contribution in block i and hence 

referred to as the block contribution, and = Uij — Ui. with T>jUij = 0 for every i. 
We rewrite (9.2) further as 

T^k = B• (Bi — B•) + Uij + f. {Tk — T.) 

= (B. -f- T.) + (Bi — B.) + (Tk — T.) 4- Uij 
=// + 爲 + 77c + ^ij ; (9.3) 

where the terms are defined in an obvious way. The physical interpretation of the 
quantities in (9.3) is as follows: 

fi is the (conceptual) overall mean yield which would be obtained if each treatment 
were applied to every unit in every block, that is, /i = f.. 

Pi is the difference between the (conceptual) mean yield of all treatments on all 
units in block i and /x, that is, (3i = fi.. — T.. 

Tk is the difference between the (conceptual) mean yield of treatment k applied to 
all units in all blocks and /i, that is, 丁 k = 宁，上 —T and 

is the difference between the (conceptual) mean of the yields of all treatments 
on the jh unit of block i and the mean yield over the whole block, that is, 
Uij = Tij, - fi... It measures the extent to which unit j deviates from the 
other units in block i. We shall refer to this quantity (as we did to a similar 
quantity in Section 6.3) as the unit error (the same results following a slightly 
different argument were given by Wilk, 1955). It follows, of course, from the 
definitions that 

a = o, = o. 

i k 

To characterize the randomization process we introduce the design random variable 
5 匕 =1 if treatment k is assigned to the jth unit in block i, and 5^ = 0 otherwise. It 
follows then immediately that 

p ( 5 ij = i ) = j 
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for any i, j, k because each treatment is applied to only one unit in each block. The 
distributional properties of the can be derived easily following the same arguments 
given in Section 6.2. 

Now let yik denote the observed yield of treatment k in block i. We can then write, 
linking the conceptual yield to the observed yield via the process of randomization, 

yik = (9.4) 

3 

that is, the yik{i = 1,2..... 6; fc = 1,2.... ,t) are a realization of bt observations from 
the population of bt 2 conceptual observations Tijk. Using (9.3) in (9.4) we obtain 

Vik = Pi + 

j 

= fi ~r Tfc ^ik (9.5) 

as a derived linear model for the observations from an RCBD. The only random vari¬ 
able on the right-hand side of (9.5) is Its distributional properties can be estab¬ 
lished easily with the help of the distributional properties of the 5^. For example, we 
obtain 

ER{(jJik) = Er ( 的 j) u .ij 

j 

1 ^ 

= 飞 = Q 

j 

and 

varR(u ik ) = E R {uj ik ) 2 - [E R (u ； ik)} 2 

r -12 

• j J j 

+禮 4-1) ‘ _ 

where we define 

a iu = 5^4 ， （ 9* 7 ) 

j 

that is, g\ u measures the variability of the EUs in block i. Also, for k ^ k f , 

COV 尺 (U /’ 认， ^ik' ) ~ —飞 a"fu ， (9.8) 

that is, observations in the same block are correlated, and for i ^ i' 

COV fi j ， yJ ^i'k) = 0- (9.9) 

that is, observations in different blocks are uncorrelated. 
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using (9.7). To estimate (9.13) it then remains to estimate This is achieved 

through the analysis of variance. 

9.2.4 Analysis of Variance 

The ANOVA table for the RCBD is obtained from the following identity, mimicking 
(9.5 )， ^ ^ 

Vik = y.. + (yi. - y.) + (y.k - y.) + (yik - m. - y.k + y..)- (9.14) 


9.2.3 Estimation of Treatment Contrasts 


It is obvious now that an unbiased estimator for a treatment contrast, E/cC^rfe, with 
EfeCfc = 0, is given by the same contrast in the treatment means, that is, 


Er 


^CkV.k 
L k 


〉: Ck 丁 k 


with 


var/2 


〉: ^kV.k 
.k 


^2 c 2 k va.r R (y. k ) + ^ c k c k ' coy R {y, k ,y. k ')- 

k ky^k r 


Now 


v arfl(u ； ifc) + ^2cov R {uj ik ,u ； i'k) 
i ijH' 


varfl(y. fc ) = ^ 

using (9.6), (9.8), and (9.10), and 

cov R{y.k^y.k f ) = 4 ^2 C0VR ^ ik ! ^ ^ + XI COVR ^ ik 5 Ui，k， ) 

i i^i' 

i l ▽ 2 

i 

using (9.8) and (9.9). Substituting (9.11) and (9.12) into (9.10) we obtain 

^ H c fe S ct * 2 « 


y&r R > ^ c k y. k 


(9.10) 


(9.11) 


(9.12) 


k i 


3 

1i 

( 9 . 
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Transferring y.. to the left hand side of (9.14) and squaring both sides yields 

^2(yik - y..) 2 = tY^{yi. - y.) 2 + b^iy.k - y..) 2 + ^{yik - m. -y.k + v..f- 

i,k i k i.k 

This is the partitioning of the total sum of squares into block, treatment, and error com¬ 
ponents as given in Table 9.2. It is easy to work out the expected values of the various 
sums of squares, where the expectation is taken over all possible randomizations. Since 
yi, — y.. = ft is a constant (using (9.5)) we have 

E R [SS(B)}=SS(B)^tY,0f, 

i 

but y,k — y.. = T/t + HijSfjUij jb is not a constant and hence 

k ij 

Also, SS(Total) is a constant and hence 

S R [SS(Total)] = SS(Total) = +bJ2 T k +J2 u ij - 

i k i,j 

By subtraction we then find 

糾 (SS ⑹] =(1 — 去 ) 

\ ^ ij 

From these results the 五 (MS) under additivity in the strict sense are as given in Ta¬ 
ble 9.2. 

We comment briefly on these results: 

(i) We note the “asymmetry” of blocks and treatments as manifested in the different 
forms for £^[MS(_B)] and £ , j r[MS(T)]. We shall return to this point later (see 
Sections 9.2.6 and 9.3). 

(ii) It follows from (9.7) that 

. 1 y 

that is, the average of the variabilities of the EUs within blocks. 
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(iii) It follows from Table 9.2 that E Uy/b(t — 1) is estimated by MS(E) so that 

vSxr (9.15) 

9.2.5 Randomization Test and F-Test 

For testing of the null hypothesis that there are no differences among treatment effects 
we turn as in Sections 6.5 and 6.6 to the randomization test and its approximation by 
the F-test. Let us define 

^ = S u l 

and consider the test statistic 

Z = SS(T)/[SS(T) + SS(E)} (9.16) 

which under Bo : 丁 1 = 丁 2 = … = 丁 t = 0 is equal to 

Z = SS(T)/U. (9.17) 

We want to compare the distribution, or more precisely the first and second moment, 
of Z in (9.17) under randomization theory and normal theory. Since is a constant we 
have to find ^h[SS(T)] and var^[SS(T)]. From Table 9.2 we know that, under H 。， 

E r [SS(T)] = U/b. 

It can be shown, after tedious and lengthy algebra (Kempthorne, 1952), that 



v&TR [SS(T)} = j i - T ^{U 2 -K), 


where 

/ \ 2 



u l ] ‘ 

i \ 3 / i 


If we assume that g\ u 

= g\ u = = a\ u , then — U/b for every i 

and hence 


K = U 2 /b. 


Then 

WSS (: T ) 卜 |^C/ 2 


and hence 

Er(Z) = ^ 

(9.18) 

and 

…、 2(6-1) 
va r R (Z)= {t _ i)b3 . 

(9.19) 
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If the oJik^ in (9.5) were normally and independently distributed with mean zero and 
constant variance, then MS(T)fMS(E) would follow an F-distribution with t — 1 and 
(b — l)(t — 1) d.f. or, equivalently, Z as given in (9.16) follows the beta (a, (3) distri¬ 
bution with a = t — l^d = (b — l)(t — 1). It then follows (see Section 6.6) that 

E(Z) = i (9.20) 

and 

var(Z) = 2(6- 1 )( 卜 l) 2 

( [6 ( 卜 1 謂一 1)+2] 

or if 2 is small compared to b(t — 1), 


var(Z) ^ 


2 (^- 1 ) 

6 3 ( 卜 1). 


(9.21) 


It then follows from comparing (9.18) with (9.20) and (9.19) with (9.21) that the means 
of Z are the same under randomization theory and normal theory and that the variances 
are approximately equal. We conclude from this that the F-test 


MS(r) 

MS{E) 


(9.22) 


is a reasonable approximation to the randomization test to test Hq: t [ 二丁 2 = … = 

= 0, a result first obtained by Welch (1937) and Pitman (1937) following Fisher 
(1935). Just as in Chapter 6 this result can be further substantiated by computational 
methods, either by enumerating all possible randomizations or, if that proves to be 
prohibitive, through simulation (Monte Carlo) studies. 

Individual contrasts among treatment effects are tested by using a t-test (or equiv¬ 
alent tests for multiple comparisons; see Chapter 7) in connection with result (9.15). 
There is no theoretical justification for this approximation but empirical results, such 
as simulation studies, seem to indicate that such a procedure yields satisfactory results 
(see Kempthorne and Doerfler, 1969). 


9.2.6 Additivity in the Broad Sense 

Up to this point we have considered the case where the assumption of additivity in 
the strict sense holds. It is, of course, desirable and indeed necessary to broaden our 
assumptions and extend model (9.5) so as to include in addition to unit error, other 
errors such as treatment error and observational (sampling) error. This can be done by 
using the same arguments as given in Section 6.3. We shall not give details here except 
to state the model under the assumption of additivity in the broad sense as 

M + di + 77c + ^ik + v ik + Vik ： (9.23) 

where Uik is the treatment error and rfik is the observational error with means zero and 
variances and respectively. Using this model in the ANOVA table (Table 9.2) 
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the E(MS) can be obtained easily and are as given in the right-hand column under 
E(MS). If we define ‘ ^ 

Y^u^/bit - l) = al 

we recognize the similarity of the 五 (MS) under models (9.5) and (9.23) in that for 
E[MS(T)] and £'[MS(E)], has been replaced by say，whereas 

for E[MS(B)] only + has been added. Thus, under model (9.23) the u asym- 
metry” between blocks and treatments discussed earlier is still preserved. This implies 
three things: 

(i) Under Hq: = 丁 2 = … = 丁 t, MS(T) and MS(E) have the same expected 

value, that is, the design possesses the property of unbiasedness; 

(ii) assuming equality of the unit error variances the statistic (9.22) can still be used 
to test Hq \ Ti = 丁 2 二 … = 丁仏 

(iii) there does not exist a valid test for testing equality of block effects (we shall 
return to this point in Section 9.3). 

We shall elaborate briefly on the result (iii) above since we consider this to be an impor¬ 
tant and often not understood finding. It formalizes what should be intuitively obvious, 
namely that a distinction needs to be made between interventional and observational 
studies in general，and the RCBD and the two-factor observational study, specifically. 
With regard to the latter, in both situations the observations are expressed in terms of 
a two-way classificatory linear model (see Section 4.3.2). For the observational study 
the two factors in this model are equivalent (symmetric), whereas for the experimental 
study they are asymmetric: the treatments (levels of factor 4) are randomly assigned 
to the EUs，but the blocks (levels of factor B) are not randomly assigned. That this 
should lead to different properties - related to statistical inference - for the treatment 
and block effects of model (9.23) becomes explicit only through careful consideration 
of the various error components as exhibited in (9.23) and subsequent application of 
randomization theory. This is in sharp contrast to the usual - and incorrect, we might 
add - discussion of this important and far reaching topic. 

To conclude and relate this discussion to that in Section 6.3 we note that Sik = 
ujik + Vik in (9.23) is referred to as the experimental error and that for all practical 
purposes, that is, for purposes of inferences about treatment contrasts, the can be 
regarded as i.i.d. random variables with mean zero and variance a^. We may write 
further + rjik = and hence model (9.23) as 

Vik — H- T/c ~h Cif ,；, 

where the can be considered also as i.i.d. random variables with mean zero and 
variance 

of = 4 + CT《. 

We then have for a contrast of treatment means, Eckp.k ， 

var (E Cfcf/.fc) (9-24) 

k 

and tests can be made in the familiar way. 
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9.2.7 Subsampling in an RCBD 

Just as in a CRD (see Section 6.9) we can encounter in an RCBD (as, in fact, in any 
error-control design) the situation that EUs and OUs are not identical. For an illustra¬ 
tion consider Example 2: Experimental Situation IV in Table 2.6. We refer to this as 
an RCBD with subsampling. The important point here is that now model (9.23) can be 
written as 

qj. Vikl = + 0i 十 Tfc + + Z/jfc + TJild 

Vikl = + €ik + 7]ikl 

with l = 1,2, ..., n and n indicating the number of OUs for each EU. As a con¬ 
sequence we are able to estimate the experimental error variance component of 二 
g\ + al and observational error variance component al separately. This follows in an 
obvious way from the expected mean squares in Table 9.3, namely, 

al = US(OE) 

d\ = [MS{EE) - MS(OE)]/n . 

And, most importantly, the hypothesis Ho : ri = T 2 = . ^ = r t — 0 can be tested by 
approximating the randomization test by the F-test 

尸二 MS(T) 

— MS{EE) 

with t — 1 and (b—l)(t — l) d.f. Also, since for a contrast of treatment means, ^2 c kV.k.-> 
with L Cfc = 0 we find 

kV.k) = ^^cK^+n^ 2 ) 

and + na\ is estimated by MS(EE) it should be clear that MS(EE) plays now the 
important role in any inference concerning the treatment effects. 

9.3 RELATIVE EFFICIENCY OF THE 
RANDOMIZED COMPLETE BLOCK 
DESIGN 

9.3.1 Question of Effectiveness of Blocking 

In many practical situations it is quite obvious that there are substantial differences 
between the blocks, and hence between the block effects, that is, the Bi's in terms of 
model (9.2) or the /Vs in terms of model (9.3). In such cases there is no doubt that the 
naturally arising or created blocks should be utilized for purposes of reducing experi¬ 
mental error, leading to a “small” a\. There are, however, situations where matters 
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are not as clear or only incomplete information about the blocks is available at the 
outset of the experiment. The investigator, relying on a subject matter model (see 
Chapter 2), may have used an ineffective blocking factor, that is, a blocking factor that 
leads to only a small reduction in error compared to a CRD. This reduction in error may, 
however，be offset by a loss of d.f. for SS(E) for the same number of observations, that 
is, t(r — 1) for the CRD versus (t — l)(r — 1) for the RCBD with b — r blocks. This 
then may result in a loss of power or sensitivity with respect to treatment comparisons. 

Even though, once an experiment has been conducted in a RCBD, one cannot ig¬ 
nore the blocking in the analysis one may ask the question: How much have we gained 
by using a RCBD rather than a CRD with the same number of experimental units? To 
answer this question may be useful if one were to conduct a similar experiment using 
the same or similar EUs in the future. 

To study this question Yates (1935) introduced the notion of relative efficiency (RE) 
in the context of estimation of treatment comparisons. For two designs, and D 2 
say, the RE of D\ to D 2 is defined as 


RE(Di to D 2 ) = 


Efficiency D\ 
Efficiency D 2 


var £> 2 

var £)l 


(9.25) 


where varrefers to var(Ec ； c f/ c ) for design D\{1 = 1, 2). In our case D\ is a RCBD 
with t treatments and r blocks and is a CRD with r replications for each of the 
t treatments. The RE as defined in (9.25) depends on the true variances for the two 
designs which, of course, are unknown. The best we can do then is to obtain the 
estimated RE, which we shall denote by ERE. Moreover, we have available only the 
data (observations) from the RCBD. 


9.3.2 Use of Uniformity Trials 


Following Yates (1935) we consider a uniformity trial, that is, a trial with dummy 
treatments, with b blocks and t EUs in each block. Denote the observations from such 
a trial by yij (i = 1,2,, 6; j = 1,2,..., t). The ANOVA table for data with such a 
structure is as given in Table 9.4. 

If the blocks were not used the estimated error variance would be 
SS ( 丑 ) + SS(R) _{b- l)MS(B) + b{t - 1)MS ⑻ 
bt — 1 = bt-1 * 


The estimated error variance with blocks is, of course, MS(i?), so that 


ERE(RCBD to CRD)= 


(b - l)MS(B) + b(t - l)MS(i?) 
(^- l)MS(^) 


(9.26) 


Since we have carried out an experiment with real treatments and not dummy treat¬ 
ments we do not know MS(i?). Instead we only know MS(E) from Table 9.2. Hence 
substituting MS(E) for MS(i?) in (9.26) yields 


ERE(RCBD to CRD)= 


(6 - l)MS(B) + b(t - l)MS(^) 
{bt - l)MS(E) 


(9.27) 
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Table 9.4 ANOVA for Uniformity Trial 


Source 

d.f. 

SS 

MS 

Blocks 

6-1 

SS{B) 

MS ⑻ 

Within Blocks (Error) 

b(t - 1) 

SS(i?) 

MS(i?) 

Total 

bt — l 

SS(B) + SS(i?) 


where MS(B) is also taken from Table 9.2. It is useful to mention that (9.27) can also 
be obtained by randomization arguments only, that is, by comparing restricted (RCBD) 
versus unrestricted (CRD) randomization (Kempthorne, 1955). 

To conclude this discussion we should mention that sometimes it may be quite 
feasible and appropriate to conduct a uniformity trial before actually using an RCBD. 
In that case we then know MS(i?) from Table 9.4 and hence can use (9.26) to obtain 
the ERE. 

9.3.3 Interpretation and Use of Relative Efficiency 

In general MS(B) will be larger than MS(E) and hence ERE will be larger than one or, 
as it is usually presented, larger than 100%. Since the ERE is obtained when both the 
RCBD and CRD have the same number of replications, namely, fe, expression (9.27) 
can then be rewritten as 


or 


ERE = 


^SrcRp/b 

^RCBD/b 


^crd _ var^cRD 
6 ERE = b 


The practical interpretation of ERE thus is that we require 

r = b x ERE 


(9.28) 


replications per treatment for a CRD to be as effective as the RCBD with b replications, 
that is, b blocks, using the same experimental material. We emphasize again that the 
ERE speaks only to the question of estimation, that is, precision of estimates, and not 
to the question of power, that is, sensitivity of the experiment. For this reason it may be 
advisable to consider only a RCBD with an ERE larger than, say, 125% to be “better” 
than the comparable CRD. Another interpretation of the ERE is given by Yates (1935), 
namely that (1 — 1/ERE) 100% is the percent variation among the EUs removed by 
blocking. 

We commented earlier (Section 9.2) that there does not exist a legitimate test for 
Hq ： 0i = P 2 = ••- = Pby at least not in the context of the ANOVA table, that is, 
H = MS(B)/MS(E) is not an appropriate test statistic. There exists, however, a 
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monotonic relationship between H and ERE which at least gives some meaning to H 
(Lentner, Arnold, and Hinkelmann, 1989). It follows from (9.27) that 

ERE = a- + (1 — a) H, 

where a = b(t — l)/(bt — 1). Hence ERE > 1 if and only if if > 1. This gives a 
certain usefulness to H ，but referring to our earlier discussion, it tells only part of the 
story. 

Finally, some knowledge of ERE or ERE* = 1-1 /ERE may be useful in determin¬ 
ing the number of blocks b to be used if one has determined r for a comparable CRD 
by using the tables of Bowman and Kastenbaum (1975) as discussed in Section 6.7. 
We have from (9.28) that 


ERE 


=r(l - ERE*). 


For example, if we have some idea that blocking will reduce variability by 25% then 
25% fewer replications than those required for the CRD are necessary. Bowman and 
Kastenbaum (1975) provide a limited set of tables for the number of blocks for the 
RCBD, but the above procedure may be quite satisfactory from a practical point of 
view. 


9.4 SUPPLEMENTARY INFORMATION 
AND ANALYSIS OF COVARIANCE 

9.4.1 The Model 

One method of generating blocks is to make use of supplementary information in the 
form of a covariate. The procedure is to rank the EUs with respect to the covariate 
(which of course must be available before the experiment) in increasing order of mag¬ 
nitude, often referred to as outcome groups, and then use the first t EUs as one block, 
the next t EUs as another block, and so on. Cox (1957) has shown that this method is 
preferable to a CRD with covariate unless the correlation between y and x is at least .6. 
Using the same covariate for purposes of analysis in addition to its use as a blocking 
device will generally not provide much additional information. There may, however, 
be situations where in addition to blocking the use of some covariate will lead to further 
reduction of the error variance, that is, we may consider the model 

Uik = fx + j3i + T k + ~{x, k - x..) + e* k , (9.29) 

where the e* k can be considered as i.i.d. random variables with mean 0 and variance 
(7^*. This is, of course, an obvious extension of the analysis of covariance procedure 
discussed in Chapter 8. We shall now give a brief description of the technique for the 
RCBD without repeating the basic philosophy and assumptions set forth in Chapter 8. 
This discussion also serves as an example of extending this technique to other error- 
reduction designs as well (for a general discussion of the arithmetic of analysis of 
covariance see also Section 4.13). 
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9.4.2 Least Squares Analysis 

Model (9.29) is a special case of model (8.55) 

y = X/Li + Z 7 + e% (9.30) 


where X/i represents the classificatory part 


(3|X ;J |X T ) p 


for the RCBD with 


X, 


0t 


0t 


0t 


and 


X T 


」 fetxt 


Then, using model (8.58) we have in the normal equations (8.60) 


X'X 


’ bt t3 f b b3 f t 
tOb tlb ^b^'t 
b3 t 3 t 3 f b bl t 


with rank (X’X) = 6 + t — 1. A ^-inverse of X^X is obtained by imposing the 
conditions H 戌二 0, = 0. Corresponding to (8.61) we then have 

fHo) = (x ， x)-xv 
y.. 

Vi. - y.. 


Vb. - y.. 
y.i - y.. 


y.t — y.. 

Furthermore, because of the definition of Rx as the matrix for the error sums of 
squares, we obtain 7 from (8.62) as 


八 Exy 

1 = ~e 7 x ' 


(9.31) 
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where E xy and E xx are the error sum of products for x and y and the error sum of 
squares for x for the RCBD ， respectively. Finally, we have corresponding to (8.63)，for 
example 

h = y.k - y.. - 7( 无 ./c 一 无 ..） （ 9.32) 

or 

ft-\-f k = y. k - j(x,k -x.). 

All the other results follow similarly. 


9.4.3 The ANOVA Table 

We shall comment now briefly on the ANOVA table. It follows from (8.66) that the 
error sum of squares can be written as 


SS(I|X,Z) 


Jyy 


(9.33) 


where, again, E yy , E xx , E xy refer to error sums of squares and error sum of products, 
respectively, for the RCBD. Hence 


^ = 


^xx 




(9.34) 


To test the hypothesis : 了 1 = D = •.. = T t = 0, we obtain the SS(Treatments) as 
SS(X r |3, X P ,Z) = SS(I|X*, Z) - SS(I|X,Z), 

where 


X* 


or, if we write fi pi = 叫， 


X* 


0t 


0t 


9bt 


0t 


btx(l+b) 


0t 


0t 


0t 


btxb 


It then follows from the results for the CRD that 


ss(i|x* ， z) = (r TO + E w ).- 


{T X y + E X y ) 2 

+ E xx 


and hence, using (9.35) and (9.33) 


ss(x r |a,x /3 ,z) = r yy - [T ^- + -~： v)2 - + 

XX \ ^xx ^xx 


(9.35) 


(9.36) 
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The test statistic for Ho is then obtained by substituting (9.36) and (9.34) in (8.68) with 
d = t 一 1. 

The reader should recognize that the right-hand sides of (9.36) and (8.28) are of the 
exact same form since for the CRD we have S xy = T xy + E xy and S xx = T xx + E xx 
see Table 8.1. Moreover, (9.31), (9.32), and (9.36) carry over to all other error-control 
designs in the following chapters keeping in mind only that- E xx ，and E xy are 
the appropriate error sums of squares and products, respectively, for the error-control 
design under consideration. 


9.5 MISSING OBSERVATIONS 


Even in well-planned experiments it may happen that, for reasons that cannot be as¬ 
cribed to the effect of the treatments, one or several observations may not be available. 
This destroys the simplicity of the analysis of such data, but unless the missing ob¬ 
servations occur in a particular pattern the experiment is not a complete failure. With 
existing statistical software such data can be handled easily on any computer. Using 
a general linear models program the least squares analysis can be performed and all 
necessary information will be provided. In essence then the design becomes an incom¬ 
plete block design and methods for dealing with such designs are described explicitly 
in Chapter II. 1. 

Historically, this topic has received a great deal of attention, mainly for purely 
computational reasons. Yates (1933) developed a procedure for estimating missing 
observations, substituting the estimates for the missing observations and then analyzing 
the thus completed data set in the usual fashion. This leads to an approximate analysis 
which, however, is quite satisfactory for most purposes. As we mentioned above, there 
is today no particular reason to describe and use this method for the RCBD. Yet we 
shall describe a particular method of estimating missing values here for the following 
reasons: (i) It may not always be possible to perform the least squares analysis from 
first principles for complex and highly structured data sets because the large number 
of parameters leads to normal equations which cannot be solved on existing computers 
(see for example, Perry, 1986); (ii) the method to be described for estimating missing 
observations is generally applicable but easily described and illustrated for the RCBD; 
and (iii) the method is applicable to situations other than experimental designs (see 
Hinkelmann, 1968). 


9.5.1 Estimating a Missing Observation 

The method we shall describe now was originally proposed by M. S. Bartlett and ex¬ 
panded by Coons (1957) and is based on analysis of covariance techniques (see Chap¬ 
ter 8). Consider then a RCBD with t treatments in b blocks and suppose that the 
observation for treatment k* in block is missing. We then write the model for the 
observations from this design as 

Vik = M + 爲 + T/c + "yXifz -f- 6iki (9.37) 



296 


CHAPTER 9. RANDOMIZED BLOCK DESIGNS 


where 

_ f 0 fori = i\ k = k* 

Vlk — \ Vik otherwise 

— ( — 1 for i = ? k = k* 

Xlk ~ ^ o otherwise 

It follows then from (9.37) immediately that 7 is an estimate of y^k* and from (9.31) 
we have 

(9.38) 

Now, using the special nature of and above, 

Exy = 〉二 ( 尤 ifc _ _ T./c + ^-^{yik — Vi. _ y.fc + V..) 

ik 

= x ikVik -b^x.kV.k + bd 


E X x = - b^x% + btx 2 . 

ik i k 

Substituting the values for and yik as defined above we obtain 
E X y = 0 + . + y^k* — y.. 

= \ Bi ' + \ Tk '~lt G ' 


Bi* = total of all observations in block i* 

Tk* = total of all observations for treatment k* 
G = grand total 


1 1 1 1 

(b-m-D 


Substituting (9.39) and (9.40) into (9.38) yields 


bBi* + tT^* — G 

{b-m-i ). 


We can then substitute 7 for the missing observation and proceed with the analysis of 
the RCBD as outlined in Section 9.2. 
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9.5.2 Using the Estimated Missing Observation 

As a consequence of the procedure advocated in Section 9.5.1 we have the following: 

(i) The comparison between treatment k* and any other treatment k is given by 

子 k* — T+k — ^ {Tk* + 7 ) — S.k. 

(ii) The same result is obtained by using the analysis of covariance model (9.37) and 
the procedure described in Chapter 8, namely, 

千 k* — 乎 k = y.k* — y.k — ~ 元 ,k) 


with 

x.k- = -p x,k =0(k ^ k*). 

(iii) It follows easily from the results of Chapter 8 that 


var(ffc* - r k ) 


，2 1 bt 

、厂庐 (6-1)( 卜 1), 

，2 t \ 

b(b-i)(t-i)) 


(iv) For k, k f ^ k* 


子 k _ 丁 k’ = y.k — y.k' 

var(f,-^) = ^- 


(v) 

, E°- 

SS(E) = Eyy - •=—； 

■^xx 

where E yy is obtained with yi*k- = 0 and E xy and E xx are as given in (9.39) 
and (9.40), and 

MS{E) = SS(£)/[(6 — 1)(* - 1) - 1)] = a 2 e ., (9.41) 

that is, the d.f. are reduced by one for the one missing observation. We note here 
that MS ㈤ in (9.41) is the same as would be obtained from the least squares 
analysis for the incomplete data set (see also Chapter II. 1). 

(yi) The SS(T) with 7 substituted for yi 中 is positively biased (see Exercise 9.10) 
and hence the usual F-test for testing Hq ： ri = T 2 = ■- = r t will only be 
approximate. Only in borderline cases of significance, however, will one need 
to obtain the exact SS(T) from the least squares analysis (see Chapter II. 1) or 
correct for the bias (as found in Exercise 9.10). 
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9.5.3 Several Missing Observations 

If two observations or more are missing we can extend model (9.37) and include one 
covariate for each missing observation. The methods of Section 8.7 can then be em¬ 
ployed using the same ideas as outlined above. Explicit missing value formulas are 
given by Glenn and Kramer (1958). 

We shall conclude this section with a brief discussion of the general case, that is, 
the case of m missing values and error-control designs more general than the RCBD 
(for example, the Latin square design of Chapter 10). To facilitate the discussion we 
use notation of Chapter 4 (in particular Section 4.13). 

Let 

y = X/3 -f e (9.42) 

represent the model for a given error-control design where f3 represents all the parame¬ 
ters associated with that design. For the RCBD, for example, f3 represents the constant 
"，the block effects /3i, /? 2 ,.. • ， A?，and the treatment effects ri,r 2 ,... ,r t . Let us now 
write (9.42) as 

y=(y；) = (xO /3 + e (9 ' 43) 

and let us suppose that yi is being observed and y 2 , representing m observations, is 
missing. Corresponding to (9.37) we then introduce the model 

（9 . 44) 

that is, we replace in (9.43) the vector of the missing observations, y 2 , by 0 and intro¬ 
duce covariates in the form of the matrix 



with I = I m . The NE for model (9.44) are then 

x ， x3 - Ki = xv* = x； yi 

-X 2 0 + 7 = Z'y* = 0. (9.45) 

By the usual covariance argument (see Section 4.13) we obtain from (9.45) 

Z [I- P X ]Z 今二 Z，[I - Px]y*. (9.46) 

We recognize, of course, that the elements of the coefficient matrix on the LHS of 
(9.46) are obtained as the error sums of squares and error sums of products for the 
given error-control design with the m columns of Z used as “observation” vectors. 
Similarly, the RHS of (9.46) is obtained as the error sums of products of the columns 
of Z and the vector y* as defined in (9.44). Solutions to the equations (9.46) for the 
RCBD are the missing value formulas given by Glenn and Kramer (1958) mentioned 
earlier. 
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Now let 

I-Px=d ， 

where U, V, W are of order (n — m) x (n — m), (n — m) x m, and mxm, respectively, 
where n is the size of y*. Since I — Px is idempotent we have 

/U V\ /U V\ _ /U V\ 

[y w 八 V wj _ yv' w) 

and hence V^V + W 2 = W. We also have 

Z^I-Pxjy^-VVi 

Z'[I- P X ]Z = v，v + w 2 = w. 

The NE (9.46) for 7 then are 

W^y= -VVi 

and hence 

7 = (9.47) 

It follows then from (9.47) that 

var( 7 ) = W - 1 V/VW -1 。 2 

=W _ 1 (W-W 2 )W-V2 
=(W- 1 -I)a 2 e . 

Returning now to the NE for (3 we have from (9.45) and (9.47) 

X'X0 = X； yi + 

= X , 1 y 1 -X , 2 W- 1 V , yi. (9.48) 

Since the model value of the LHS of (9.48) equals the model value of the RHS we have 
X，X = X；Xi - X^W - 1 V% 
and 

= - [X，X - X ； Xi] = X^X 2 . (9.49) 

It follows then from (9.48) and using (9.49) that 

var(X’X 匆 =(X; — X^W^OCXi - VW~ l X 2 )a 2 e 

=[X'X + (9.50) 

In the context of our approach to the analysis of data from designed experiments, (9.50) 
should be used to obtain the variances of estimated treatment contrasts. It shows that 
if a treatment contrast does not involve a treatment with missing observations then the 
variance is the same as would have been obtained from the complete design. For other 
treatment contrasts, (9.50) shows how the variance will have to be adjusted, that is, 
increased. 
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9.6 NONADDITIVITY IN THE RCBD 

9.6.1 The Problem of Nonadditivity 

In our discussion of the RCBD so far we have made the assumption of treatment-unit 
additivity in the strict sense [see (9.1) and (9.3)] or, more realistically, additivity in 
the broad sense [see (9.23)]. In most cases such an assumption is not unrealistic, but 
we may, of course, conceive of situations where it does not hold. Such a situation we 
shall refer to as nonadditivity. An explicit formulation of this situation can be given by 
amending model (9.1) to 

Tijk. = Ujj + + Rijk, (9.51) 

where Rijk refers to the nonadditivity which we also call unit-treatment interaction. 
Such interaction may arise in two ways: 

(i) The effect of the treatment depends on the EU to which it is applied in the sense 
that if we could apply two treatments, say k and k\ to the same EU, then we 
would observe 

Tijk _ Tijk’ 寺 T.ij’k — 

for two EUs j and j" in block i. 

(ii) The effect of the treatment depends on the block in which it is applied in the 
sense that if the EUs in a block were identical 

Tijk — 7 ^ Tj / 一 Tj/j”’k’ 

for any two EUs in blocks i and〆. 

We may refer to the first type of interaction as the strict unit-treatment interaction, and 
to the second as block-treatment interaction. 

It should be clear from the description of the nature of strict unit-treatment inter¬ 
action that there is no way to investigate whether it exists (see Kempthorne, 1952, and 
Wilk, 1955) because in the RCBD we can apply only one treatment to each EU. Even 
the block-treatment interaction, and it surely is the more important of the two, cannot 
always be addressed satisfactorily. We shall distinguish here between two situations: 
(i) there is only one blocking factor, either a non-specific factor (U) or an intrinsic 
factor (Z); (ii) there are several blocking factors involving factors from Z and U. For 
scenario (i) we shall describe two ad hoc procedures (Sections 9.6.3 9.6.5)，and for sce¬ 
nario (ii) we shall outline appropriate analysis of variance procedures (Section 9.6.7). 
An alternate procedure, addressing design questions, will be discussed in Section 9.7. 

9.6.2 General Model for Nonadditivity 

In light of our discussion above, we can rewrite (9.51) as 

T^k = Uij + Tfc 4- Qik -)- Sijk ， (9.52) 



9.6. NONADDITIVITY IN THE RCBD 


301 


where Qik represents the block-treatment interaction and Sijk the strict unit-treatment 
interaction. Following (9.2) and (9.3), model (9.52) can be written as 

Tijk. = Uij Tk Qik ^ijk 

—fl- pi {0T)ik Uij Sijk ， 

the terms of which are defined as functions of the as follows (see Wilk, 1955) 

M = f . 

Pi = f L . - f.. 

Tk = T, k - T.. 

(0 丁 ) ik = (fi.k — fi..) — _ t,.) 

Uij = fij. - fi., 

Sijk = {T'ijk - fij.) - [Ti.k — fi..)- 

The actual observations, can then be expressed again as 

Vik — 的 jTijk 

3 

=/I + + Tfe + (0T)ik + ^ ^ij u ij + ^ 的 j Sijk 

3 3 

=fX pi - (/^T ) 认十 CJ 认 + Qijk ， (9.53) 

where (/3r)ik represents the block-treatment interaction with 

D 卢 丁 )ik ~ ^2(0T)ik = 0 
i k 

and u)ij and Qjk are random variables representing unit error and strict unit-treatment 
interaction, respectively. We shall assume now that all s^k — 0 and hence model (9.53) 
reduces to 

Uik = M H - Pi + A + )i/c + ^ik 

or, if we add [see model (9.23)] treatment and observational error, 

Vik = M + 汰 + Tfe + ^ik • (9.54) 

It is clear from (9.54) that (8r)ik and lead to the same entry in the ANOVA table, 
that is, cannot be separated, with 

E[MS(E)\ = a 2 e + 丁 )l'/(b-l)(t-l). (9.55) 

i.k 


Since there is no mean square with expected value equal to there does not exist a 
test for Hq : (,dr)ik = 0 for all % and k. 
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9.6.3 One Blocking Factor: A Specific Model for 
Nonadditivity 

It is evident from (9.54) and (9.55) that the interaction as expressed by the ( 6 —1)(^ —1) 
independent [ ， & r)ik’s must be modeled more specifically so that it accounts for some 
of the (b - 1)(^ — 1) d.f. with the remaining d.f. attributable to random error. Such a 
procedure was first proposed by Tukey (1949) and more explicitly by Mandel (1961). 
We shall describe briefly Mandel’s procedure and show how it relates to Tukey’s pro¬ 
cedure. 

It follows from (9.54) that 

E(yik - Vi.) = + {0r) ik 

and under additivity, that is, all (y3r)^ = 0 , that 

~ yi.~) 二丁 k ， 

which is independent of i. One way to model the dependence of E{yik — yi.) on i is to 
consider a linear function of /3j and write 


E{yik - yi.) = n + QkA ， 

(9.56) 

that is, assume 


丁、 ik = QkPi> 

(9.57) 

We are then considering the model 


Vik = M + 执 + Tfc + 

(9.58) 


with T.iSi = EfcTfe = SfcQfc = 0. In order to give a concrete interpretation to model 
(9.58), let us write Qk = 7 ^ — 1. Then (9.58) becomes 

Vik — 了 /c + ""IfkPi "I" ^ik 

= "A; + ^ikt (9.59) 

where "/c = fi + r^. Thus the data from a RCBD can be expressed as a set of t 
regression lines where the b observations for treatment k(k — 1 ， 2 ,..., t) are regressed 
on the block effects. It follows from EkQk — 0 that 

7 =|[ 7 /c = l. (9.60) 

k 

If all 7 ^ are equal and because of (9.60) equal to 1, then (9.58) reduces to the additive 
model, hence departure of some 7 ^ from 1 indicates block-treatment interaction. Since 
the the regressor variables, are not known, we replace them by 0i = yi, — y.. and 
hence obtain in the usual way 

〉: Vik Pi 

i 


Ik = 


(9.61) 



9.6. NONADDITIVITY IN THE RCBD 


303 


Using [i = = y.k — y.. we can then write the following identity as suggested by 

(9.58) ^ 

Vik — A + A + + {lk ~ + ^ik (9.62) 

with 

△ifc = (jjik - y.k) _ 


9.6.4 Testing for Nonadditivity 


Using the properties of the various terms in (9.62), it follows easily that 

yfk = + b J2 f k + — u 2 街 + H 

i,k i k k i ik 

provides a partitioning of the total sum of squares and gives rise to the basic ANOVA 
in Table 9.5. Mandel (1961), following arguments given by Scheffe (1959), has shown 
that, under i^o : 71 = 72 = • • • = = 1 and the assumption of normality, SS(Slopes) 

follows a scaled central x 2 -distribution with t — 1 d.f. Also, SS(Error) follows a scaled 
X 2 -distribution with (6 - 2)(t - 1) d.f” the scales being the same for both sums of 
squares. Since both sums of squares are independently distributed it then follows that 


F — MS(Slopes) 
~~ MS(Error) 


(9.63) 


provides a test for Hq : 71 = 飞 2 = … 7 亡 =1 and hence a test for block-treatment 
interaction of the form specified by model (9.58). A derivation of this test based on 
randomization theory is provided by Roux (1984). 


9.6.5 Tukey’s Test for Nonadditivity 

Tukey (1949) implicitly and Ward and Dick (1952) explicitly consider a special case 
of (9.58) by using 

Qk = 0r k (9.64) 

and hence the model 

Uik M H - Pi ^~k "I - "I - ^ik (9.65) 

to detect interaction. This may seem to be a very specialized and narrow model but 
Ward and Dick (1952) show that (9.65) arises from a multiplicative model of the form 

Vik = + Pi + e， ik){^ n + T k + e ik) 

with E/3 t - = = 0. Following earlier arguments we can write (9.64) as 

— 1 = 

that is, the regression coefficients are expressed as a linear function of the treatment 
effects. Writing 


7 fc - 1 = + 5 k 
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Table 9.5 ANOVA for RCBD under Nonadditivity 


Source 

d.f. 

SS 

Blocks 

b-1 


Treatments 

t-1 

bYiiv-k-y：) 2 

Slopes 

t 一 1 

- i) 2 52( 识 . -y .) 2 
k i 

Regression 

1 

i k 

Deviation 

t-2 

Subtraction 

Error 


Subtraction 

Total 

bt — 1 

^{yik-y ..) 2 

ik 


with E/c(5fe = 0, we obtain in the usual way the estimate of the regression coefficient 
6 as 


- l)^fc 

e = ^ — —— 

I > 2 

k 

Y2yik(vi. - y..){y.k - y..) 

— ik 

i k 

It follows then that in Table 9.5 

SS(Slopes)= I>-D 2 EA 2 

k i 

k i k i 

-^ 2 E^ 2 EA 2 + E^E4 2 

k i k i 

can be partitioned into two components, 

SS(Regression) = 9 2 ^ ^ Sf 

k i 


(9.66) 


(9.67) 


(9.68) 
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and 


SS(Deviation) = ^ ^ ,df. 


(9.69) 


where 


K = Ik - Ofk - 1 . 


To test the hypothesis of no interaction, that is, Hq ：0 
test statistic, 


0, Tukey (1949) proposed the 


F 


SS(Regression) 


[SS(Deviation) + SS(Error)]/[(6 - l)(t -1)-1] 


or, using the notation from Table 9.2, 


F 


SS(Regression) 


[SS(E) - SS(Regression)]/[(f> - l)(t - 1) - 1 ] ， 


(9.70) 


which follows an F-distribution with 1 and (b — 1)(^ — 1) — 1 d.f. (see Scheffe, 
1959). Robinson (1975) has shown that this test is a reasonable approximation to the 
corresponding test based on randomization theory. The test given by (9.70) is generally 
referred to as Tukey’s one-degree-of-freedom test for nonadditivity. 

An alternative derivation of (9.66) and (9.70) is given by Scheffe (1959). It is based 
on the model 

Vik = H - 0i H - ^ik 

with x ik = (vl - y.){y.k - y..), that is, replacing dm in (9.65) by 戌 Using the 
analysis of covariance technique (see Section 9.4) we obtain 


Exy 

Exx 


^2(yik - Vi. - y.k + y..)( x ik - St -x. k + $.) 


ik 


> - 工 i. - 工 ./e + ) 


ik 


〉 : Vik^ik 


ik 


E- 

ik 


since = x.. 


0. Hence 9 takes on the form (9.66). 


9.6.6 Generalizations 

A generalization of Tukey’s test was proposed by Mandel (1971) and investigated in 
more detail by Johnson and Graybill (1972). They consider a model of the form 


Vik = ". + 爲 + A + H - 6^ 


( 9 . 71 ) 
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and derive a test for Ho : 沒 = 0 by using the i x 6 matrix of residuals Z = (Zik ) ， where 
Zik = yik - Vi. — y.k + y... They derive the likelihood ratio test statistic 


A* = 


z fk - 


bt/2 




where Ai is the largest eigenvalue of Z’Z. The distribution of A* is then related to 
that of the eigenvalues of a Wishart matrix. For more details the reader is referred to 
Johnson and Graybill (1972), Corsten and van Eijnsbergen ( 1972), and Marasinghe 
(1985). 

We close this section with the following remarks: The Johnson-Graybill procedure 
outlined above and the extensions due to Mandel (1971) using more than just one term 
to model interaction, that is, using more than just the largest eigenvalue of Z ; Z, were 
developed for two-way layouts with one observation per cell. Even though data from 
a RCBD can also be presented as a two-way array, we have pointed out earlier in this 
chapter that there exists a certain asymmetry between blocks and treatments. For this 
reason we prefer to test for nonadditivity in the RCBD by using (9.63) or (9.70). 

As models (9.58)and (9.65) indicate the type of interaction included in the model is 
of a very specific form. This limits, by necessity, our general inquiry into the possible 
existence of block-treatment interaction. Obviously, if the tests presented by (9.63) or 
(9.70) are significant then such interaction is present. If those tests, however, are not 
significant then this indicates only that interaction of the specific type is not present, 
but this does not preclude block-treatment interaction of a different type, except that 
we may be unable to detect it. 


9.6.7 Several Blocking Factors 

We now turn to the case of several blocking factors. Such cases can occur in different 
ways which are characterized by the different relationships of the factors with each 
other, that is, whether they are crossed or nested (see 4.12.2). We shall illustrate this 
first in terms of the following examples before we consider the question of block- 
treatment interaction. 

Example 9.1: An experiment was set up to study genetic parameters for radiata pine 
(Dean, et al, ， 2006). The full-sib families from a half-diallel cross with five parents 
represent the treatments which are evaluated for several growth traits, such as height 
and sectional area of stems, in a field trial using an RCBD at two different sites. More 
specifically, the basic arrangement consists of six blocks per site, each block containing 
ten plots to which the ten full-sib families are assigned at random. On each plot five 
trees from the assigned full-sib family are planted. Both sites are located on the North 
Island of New Zealand, but one site has colder climate than the other, being thus more 
prone to frost and needle blight. □ 
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Example 9.2: The objective of an agricultural study was to determine the effect of 
each G. soja SCN resistance gene on yield and other agronomic traits in elite soybean 
backgrounds (Kabelka et al., 2006). One part of the field trial consists of growing 
plants from 100 derived genetic lines, representing the treatments, in 1.5 x 3.2 m two- 
row plots, representing the experimental units, in an RCBD with 6 = 2 blocks (repli¬ 
cations). This basic arrangement was repeated, using different randomizations in four 
different environments. More specifically, the environments represent combinations of 
two years and different sites. Plots were evaluated for days to maturity, plant height, 
lodging, and seed yield. 口 


Example 9.3: A study is contemplated to examine the effectiveness of three meth¬ 
ods to memorize German vocabulary at the high school level (Kirk, 1982). It seems 
reasonable to take student ability, as measured by IQ, and gender into account. Thus, 
one approach would be to set up, say, five IQ classes, with three students for each class 
and gender, leading to ten blocks. In each block the three methods under investigation 
will then be assigned randomly to the three students. □ 

The above examples have one feature in common: They can be looked upon as 
replicated randomized block experiments ， where one of the blocking factors can be 
considered the “replicating” factor, typically an intrinsic factor (Z). In Example 9.1 the 
basic RCBD is replicated at different sites, indicating different climates; in Example 
9.2 the replicates are the different environments, and in Example 9.3 we have an RCBD 
for each gender. What is different, however, is the relationship of the blocking factors 
to each other: In Examples 9.1 and 9.2 we have a nesting relationship, whereas in 
Example 9.3 we have a crossed relationship. More specifically, the blocks in the field 
are nested within the sites and environments, respectively, because the blocks at one 
site (environment) are different from the blocks at another site (environment) whereas 
the IQ classes are the same for both genders. 

This crucial difference in the blocking factor relationship is reflected in the linear 
models that describe the data from such experiments as we extend model (9.23) and 
exploit those relationships. To show this we begin with model (9.23) 


Vik = " + 汰 + % + (9.72) 

where in this form the 成 represent the “block” effects of the site-block (Example 9.1) 
or environment-block (Example 9.2) or the IQ class-gender (Example 9.3) combina¬ 
tions. It is advantageous, however, to explicitly represent the individual blocking fac¬ 
tors and their relationship to each other, because this will allow us to test certain aspects 
of “block”-treatment interaction which may be important for the analysis and interpre¬ 
tation of the experimental data. 

For the general development of extending model (9.72) we shall denote the two 
blocking factors by A and C, with A having a “levels” and C having c “levels”. Thus, 
the total number of blocks is 6 = ac. We then consider the following two cases: 
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(i) Nested blocking factors: 

For this case the 8i in (9.72) will be expanded as ai + 7 ^, and the new model may 
contain an interaction term representing possible interaction between blocking 
factor A and the treatments. Thus, the model may be written as 

Uijk = OLi "I" + Tfc + {ocT (9.73) 

with z = 1 , 2 , ..a; j = 1 , 2 , ..c; fc = 1 , 2 , .. t\ oli representing the effect 
of the i-th level of A, and 7 ^- representing the effect of the j-th level of factor C 
at the i-th level of factor A. The interaction term (ar)ik represents part of the 
block-treatment interaction, namely factor A-treatment interaction (A x T). 

The analysis of variance associated with model (9.73) is given in Table 9.6. A 
derivation based on randomization theory is given by Stewart (1980) and Hinkel- 
mann and Alcorn (1998). Table (9.6) indicates that 

MS(A x T) 

一 MS(E) 

with (a — l)(t — l) and a(c—l)(t — 1 ) d.f. provides a test for part of the possible 
block-treatment interaction, AxT. 

In Example 9.1 the blocking structure is the same as in Example 9.2, but the ex¬ 
perimental set-up provides an additional feature, namely subsampling (see Sec¬ 
tion 9.2.7) as each tree is an observational unit. An obvious extension of model 
(9.73) is given in (9.74)，with the associated analysis of variance given in Table 
9.7 , 、 " — 

Vijkl = M + lij + Tfc + {otr)ik -f Cijk + Tjijkl (9.74) 

with l = 1 ， 2,… ， n. It is clear from Table 9.7 that now the experimental error 
plays the important role in testing for A x T interaction as the denominator of 
the F-ratio 

MS{AxT) 

— MS{EE) 

with (a — l)(t — 1 ) and a(c — l)(t — 1 ) d.f. 

(ii) Crossed blocking factors: 

Each combination of the levels of the factors A and C represents a block. This 
factorial structure (not to be confused with factorial treatment structure; see Sec¬ 
tion 11.2) leads to an expansion of Bi in (9.72) into ai + 7 ^ + (cry)y and to 
inclusion of certain block-treatment interaction terms as given in model (9.75): 

Uijk = /i• + at + + r fc + {ar) ik + {jr) jk 4 - e ijk (9.75) 

with i = 1,2,..a.; j = 1, 2,..c; fc = 1,2,.. t. The 7 j, (a?)。represent 
block effect components, and ( 7 t)j 7 c represent interactions between the 

blocking factors A, C and the treatments. Model (9.75) leads to an analysis of 
variance as given in Table 9.8. 
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In this situation it follows from Table 9.8 that we can isolate and test for two 
block-treatment interaction components, namely A xT and C x T by using the 
F-ratios 

r MS (A x T) 

- MS{E) 

with (a — l)(t — 1) and (a — l)(c — l)(t — 1) d.f. and 

MS(C x T) 

= MS{E) 

with (c - l)(t - 1) and (a — l)(c — l)(t — 1) d.f., respectively. 

9.6.8 Dealing with Block-Treatment Interaction 

In the preceding sections we have explored methods of investigating, that is, detecting 
possible block-treatment interactions for various types of randomized complete block 
designs. If interactions do, indeed, exist then the question arises: What to do next? 
How can we or, even, can we at all make inferences about the treatment effects? There 
do not seem to exist general answers to these questions. 

What are the problems? The investigator wants to compare and make recommen¬ 
dations about treatments. But the existence of block-treatment interaction implies that 
such comparisons are not the same for all blocks. Hence making comparisons in the 
usual way, that is, by comparing treatment means, may present a wrong picture. Also, 
as Kempthorne (1952, Section 8.3) showed, with nonadditivity it is not possible to 
attach “reasonable” standard errors to treatment comparisons. And finally, nonaddi¬ 
tivity in a two-way table may be due to interaction or to nonhomogeneous variances 
as pointed out, for example, by Snee (1982) who also showed how knowledge of the 
subject matter can be important in explaining nonadditivity. And modeling such non¬ 
additivity may prove to be an important aspect of data analysis. 

It is here that the distinction between RCBDs with one blocking factor and two 
(or more) blocking factors becomes important. And beyond that, for the one blocking 
factor situation we distinguish between whether the blocking factor is a nonspecific 
(U) or an intrinsic (Z) factor. 

The reason for making this distinction is that nonadditivity for these two types of 
RCBDs may lead to different actions. Clearly, for the first type any attempt of ex¬ 
plaining or modeling nonadditivity is of no value with regard to comparing treatments. 
Rather, it may be helpful to remove such nonadditivity through a suitable transforma¬ 
tion using methods described in Section 6.10. In this case it may be useful to plot 
residuals [yik — Vi. 一 V.k + y.) against the observations to gain some insight into 
the form of nonadditivity and hence obtain an idea what type of transformation may be 
appropriate. 

With regard to RCBDs of the second type it may indeed be important to model 
possible nonadditivity as a means of interpreting differential treatment effects. In fact, 
in this case block-treatment interactions may be more important than treatment effects 
themselves. We argue that then, if at all possible, a different design should have been 
used, namely a generalized randomized block design as discussed in the next section. 
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The situation is somewhat clearer and more options may be available if we have 
more than one blocking factor. Based on the analysis of variance for models (9.73), 
(9.74) and (9.75) we have displayed F-ratios that can be used to test whether certain 
components of block-treatment interaction, namely that between intrinsic blocking fac¬ 
tor and treatments, for example, A x T and/or C x T, are present. 

To discuss this in more detail let us first consider model (9.73) in the context of 
Example 9.1. 

Example 9.1 (continued): The type of question that is often asked is: Are there 
differences between the two sites? We point out that not only is this the wrong question, 
but also that such a hypothesis cannot be tested since the factor “site” is a blocking 
factor (see Section 9.2.6). Rather, what is really meant is: Are the treatments (all or 
some) performing differently at the two sites? And this is to ask the question whether 
there exists site-treatment interaction, that is, A x T interaction. 

If the A x T interaction is significant further investigation is required to enunci¬ 
ate to what extent and in which way the full-sib families, or generally, the treatments, 
perform differently at the two sites. This can be done in what is referred to as an in¬ 
teraction plot by plotting the treatment means separately for each site (see also Section 
9.7.4). The two graphs are not essentially parallel (as they would be if interaction is not 
present), but can take on various forms. If they move essentially in the same direction 
then we have what we call codirectional interaction (see Section 9.7.4) or synergistic 
interaction (see van Belle et al., 2004). In this case it is still possible and informative 
to consider inference about the overall treatment effects either in the form of tests of 
hypotheses, including various types of comparisons, or confidence interval estimation. 
If the interaction is not codirectional then it will be more appropriate to consider infer¬ 
ence separately for each site. This can be done either in the context of model (9.73), 
that is, by using the error term from the ANOVA of Table 9.6 or or by analyzing the 
data from each site as separate RCBDs. 

Finally, if the A x T interaction is not significant then we may contemplate to 
delete the interaction term (ar)ik from model (9.73) and reanalyze the data with the 
new model. This is largely a philosophical issue, and we take the point of view that 
if deleting the term at all it should be handled in the context of a preliminary test 
(Bancroft, 1964) and be dropped only if P > .25, say. The effect of this will be, of 
course, an increase in the d.f. for error from a(c _ l)(t 一 1) to (ac — l)(t — 1). □ 

We now turn to Example 9.3 and model (9.75). 

Example 9.3 (continued): Much of what we have said for Example 9.1 holds also here. 
Instead of only one interaction term we now deal with two interaction terms and hence 
potentially two interaction plots, one for A xT and one for C xT. If both interactions 
are significant and not codirectional then we face the same problem discussed earlier 
for the case of one intrinsic blocking factor. Here, again, a generalized randomized 
block design may then be a better option to repeat this experiment. 

If one and/or the other interaction is not significant then we may contemplate pool¬ 
ing the non-significant interaction(s) with the error term in the ANOVA of Table 9.8. 
□ 

To summarize our discussion, we emphasize that it is important not to ignore the 
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possibility of block-treatment interaction, especially when intrinsic blocking factors are 
involved. In that case the interaction may be more important, and the subject-matter 
specialist may be able to provide important input and insight. And this should be 
reflected in the experimental design and the subsequent analysis. Many considerations 
come into play here, and it is impossible to provide specific directions for all cases. 

9.7 GENERALIZED RANDOMIZED 
BLOCK DESIGN 

9.7.1 Definition 

As mentioned in Sections 9.6.7 and 9.6.8 there exist situations where block-treatment 
interaction is strongly suspected a priori and where such interaction may be the major 
focus of the investigation and hence explicit characterization of its form is of utmost 
importance. This may occur, for example, when an intrinsic blocking factor is intro¬ 
duced by choice to broaden the inference from the experiment, for instance, different 
varieties of plants. The generalized randomized block design (GRBD) to be discussed 
in this section is the most appropriate design for such situations. 

We call a block design a GRBD if we have b blocks, each block containing s = rt 
EUs, such that each of the t treatments is applied to r EUs in each block (note that for 
r = 1 we have, of course, the RCBD). The treatments are assigned randomly to the 
EUs, and independent randomizations are used for different blocks. 

9.7.2 Derived Linear Model 

Let Tijk denote the conceptual response if treatment k is applied to the jth EU in the 
ith block. We can then write the following identity: 

+ (Ti.. — T..) + (Tij. — Tj..) + (T.fe — T..) 

+ (Ti.k - + f_..) + [Tijk - (9.76) 

The physical interpretation of most of the terms in (9.76) have been given in Sec¬ 
tion 9.2. In addition we now have the term 

(0 丁 、 ik = {Ti.k - Ti,) — (T./c — T..) 

the difference between the effect of treatment k in block i and the overall effect of 
treatment k. To the extent that this term is different from zero, this is a measure of 
block-treatment interaction. Also, the term 

{^ijk — T'ij) — (Ti.fc - t ..)， 

that is, the difference between the effect of treatment k on EU j in block % and the 
effect of treatment k in block i, is a measure of the unit-treatment interaction. We shall 
henceforth assume that such interaction is negligible. We can then write (9.76) as 


Tijk — /i ~h U{j "l - Tj^ -f- 


( 9 . 77 ) 
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with 

〉: A = 0 ， 〉: A = 0 

i k 

s 

Uij = 0 for every i 

j=i 

y^X0r)ik = y^ipr) ik - 0 . 

i k 


The actual experiment then consists of randomly assigning each treatment to r EUs 
in each block using independent randomizations in different blocks. The procedure, 
using SAS PROC PLAN, is illustrated in Table 9.9, where in the output the treatment 
numbers are superimposed on the unit numbers within each block. 

The randomization process is characterized by the design random variables 



if treatment k is applied to the j th EU in block i 
otherwise. 


Let yiki denote the observation for the Ith replication of treatment k in block i (i = 
1,2,..., 6; A; = 1,2,... = 1,2,... .r). We then have 


㈣ =l) =; = I 

and hence 

E 病 ) =] 

and other properties of the 6^ can be established easily. The connection between the 
conceptual response and the actually observed response is then given by 


、二 Vikl — 〉 ： 於 jTijk ， 

i=i j=i 


or 

^ Viki =r[ii + 0t 4-r/c + (/3 t) 认] + ^ 碎〜， (9.78) 

I j 

This is a model based on randomization only and does not include technical errors. If 
we add technical errors the model (9.78) becomes (see Section 9.2) 

72yikl = r[H + /3i +T k + (PT) ik ] + ^2 5 ij U ij + ^2 U ikl + ^ ikl ( 9 . 79 ) 
I j l l 


with the usual assumptions for the treatment errors, i\ki ，and observational errors, 77 ^/. 



316 


CHAPTER 9. RANDOMIZED BLOCK DESIGNS 


Table 9.9 Randomization for Generalized Randomized Block Design 


a.) Input statements: 


proc plan seed=73251; 

factors block=6 ordered units=8; 

treatments treat=8 cyclic (1 1 2 2 3 3 4 4)8; 

title I 5 RANDOMIZATION FOR GENERALIZED RANDOMIZED BLOCK DESIGN ’； 
title2’(b=6, t=4, r=2 )’； 

run; 

b.) Output: 


RANDOMIZATION FOR GENERALIZED RANDOMIZED BLOCK DESIGN 
(b=6, t=4, r=2) 


The PLAN Procedure 


Factor 

block 


Plot Factor 

Select Levels Order 

6 6 Ordered 

8 8 Random 


Treatment Factors 

Initial Block 

Factor Select Levels Order / Increment 

treat 8 8 Cyclic (11223344) / 0 


block 


■units 


■treat 
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9.7.3 The ANOVA Table 

Using the identity 

Viki = y... + (yi.. — y...) + (y.k. — y...) + {Vik. —vl. — y.k. + y.. m ) + {ym — yik.) 

and writing, based on (9.79), 

Viki = " + A + Tfc + {0r) ik + e ik i (9.80) 


we obtain the ANOVA as given in Table 9.10. The expected values of the mean squares 
as given in Table 9.10 can be obtained by using the distributional properties of the S^s 
(see Wilk, 1955) and of the i/iki and the r]iki‘ Here we have defined 



1 

6(s- 1) 


E4 


= E{^ kl ) 

^ = e (Vm)- 

We remind the reader that cfu~^ a l = ls the experimental error variance, is the 
observational error variance, and cr^ = cr^ + is the overall error variance. We note 
also that the GRBD is an unbiased design in Yates’ sense, just as the RCBD, in that 
under h = 巧 = … == 0 we have £'[MS(T)] = E[MS ( 五 ) ].Further, again 
just as in the RCBD, the ANOVA does not lend itself to a legitimate test for block ef¬ 
fects because 五 [MS(B)] # 五 [MS(E)] for Pi = P 2 = = Pb = 0- 


The form of the E(MS) in Table 9.10 suggests the following tests: 


(i) The test statistic for no block-treatment interaction, that is, Hq : [0r)ik = 0 for 
every i, k is given by 


MS{B x T) 
MS ⑹ 


(9.81) 


with (b — l)(f — 1) and bt(r — 1) d.f. 


(ii) The test statistic for no treatment differences, that is, ： T i = r 2 = • * • = = 

0 is given by 


MS(r) 

MS{E) 


(9.82) 


with t — 1 and bt(r — 1) d.f., but see Section 9.7.4 

Wilk (1955) has shown that these F-tests are reasonably good approximations to the 
corresponding randomization tests. 
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9.7A Analyzing Block-Treatment Interaction 

We shall comment briefly on the usefulness and appropriateness of testing Ho : t\ = 
• ■ * = Tt = 0 when the test for block-treatment interaction has been found significant. 
In such a case it is important to study, mainly by plotting, the nature of the interaction. 
Following the discussion in Section 9.6.8 it is useful to plot yik. versus (or versus 
i) for each k. Alternatively, we may plot y^i versus y,k. (or versus k) for each i (see 
Section 9.6.8). Two examples are given in Figures 9.1 and 9.2 based on the (fictitious) 
data in Table 9.1L 

The important feature of Figure 9.1a and 9.2a is that the changes from block to 
block, though different for the various treatments, are in the same direction. We have 
referred to this type of interaction as codirectional interaction. If we define 

丁 ik = Ti.k — Ti,, 


as the effect of treatment k in block i and 


子 ik = Vik. — Vi.. 

as its estimator (i = 1,2, … ， b; k = 1,2,... .t) then codirectional interaction implies 

that if 7 "认 _ ^ 0, then also 丁 认 1 一 tw 2 0 and ( 丁认 — 丁 j/k 、 f 、丁 — 

with similar statements for the f^. Since the “trend” for each treatment is in the same 
direction, it may make sense and, in fact, it may be quite useful to compare “the” 
effects, that is, the 

Tk = lYl nk 

i 

or, correspondingly, their estimators 

Tfe = ^ Tjfc = y./c. - y..- (9.83) 

i 

Suppose, for example, that the blocks represent breeds of cattle and the treatments 
represent feeding regimens. In order to avoid unnecessary complications let us assume 
that the data given in Table 9.1 la actually are the h representing, for example, units 
of weight gain per week. Although the difference in gain between regimens 1 and 2 is 
small for breed 1, zero for breed 2, and quite substantial for breeds 3 and 4, it seems to 
be useful information that on average the gain due to regimen 2 is 2.5 units higher than 
that from regimen 1. This is the gain a farmer would realize using regimen 2 if he had 
(equal proportions of) cattle from all four breeds. 

The picture is obviously less clear if the outcome of the experiment is represented 
by the data in Table 9.11b and presented in Figures 9.1b and 9.2b. Even though in this 
case regimen 3 is the best on average, it is clear that regimen 1 is best for breeds 3 and 
4 and regimen 3 is best for breeds 1 and 2. This form of what we might call antidirec- 
tional interaction or antagonistic interaction (see van Belle et al. ， 2004)，manifested by 
the fact that not only are the differences — r^k different for different k, but they also 
differ possibly in sign (direction), obviously dictates different action for the different 
breeds and hence makes consideration of the overall regimen effects meaningless. 
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Table 9.11 Block-Treatment Averages ( 仄 &.) for GRBD 


(a) Block 




1 

2 

3 

4 

y.k. 

Treatment 

1 

2 

6 

8 

10 

6.50 


2 

3 

6 

12 

15 

9.00 


3 

4 

8 

13 

20 

11.25 


Vi.. 

3 

6.67 

11.00 

15.00 


(b) 



Block 





1 

2 

3 

4 

y.k. 

Treatment 

1 

2 

6 

8 

10 

6.50 


2 

15 

12 

6 

3 

9.00 


3 

16 

13 

7 

4 

10.00 


Vi.. 

11.00 

10.33 

7.00 

5.67 



Comparisons of the 7 ^’s within block i are, of course, estimated by 

: (9.84) 


〉 .: Cik 千 ik 
k 


〉 : c ikVik. I 〉: 1 
k V k 


with 


and 


with 


var = J2 c ik~ 

\ k- ) k 


var 




(9.85) 


^ 1 = MSOB). 


Using = MS ( 五 ) from Table 9.10 shows that, even though we consider treatment 
comparisons (9.84) within blocks, we are using the overall (pooled) error variance 
estimate for purposes of inference. An alternative procedure is to consider the obser¬ 
vations in each block as the outcome of a CRD, say CRDi, CRD 2 , ..., CRD 5 , and 
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analyze them accordingly. This means that rather than estimating the variance of an 
estimated contrast by (9.85) we would use 


var 



(9.86) 


for i = 1, 2, .. . ， 6 , where is the error mean square from the ANOVA of CRD^ 
with t(r - 1) d.f. rather than bt(r — 1) d.f. as in (9.85). If the number of d.f. becomes 
an issue, that is, if t(r - 1) is rather small, we may consider pooling the variance 
estimators = 1 ，2 , .. . ，6 ) by using a preliminary test for 

丑0 : 4 ⑴ = 4(2) = ... = <6 )， （ 9.87) 

for example, the F-max test of Hartley (1950). If (9.87) is not rejected at, say, a = .25 
then we may choose to pool and hence return to (9.85). 

If meaningful, comparisons among the overall treatment effects are estimated by 

〉 ： Cfc 子 fc = 〉 ： ( 〉: Cfc = ◦) 

k k 

with 

var =E4 吾 

\ k / k 

and 

(9.88) 


9.7.5 A More General Formulation 

The discussion so far assumes what is usually referred to as a fixed effects model. By 
that we mean that the blocks used in the experiment are the only blocks available, 
that each block consisted of exactly tr EUs and that the treatments used are the only 
treatments of interest to the investigator. Such assumptions imply the way tests of 
hypotheses are performed as described above. It also defines the reference population 
to which the results of the experiment apply. For example, the assertion of treatment 
differences can, strictly speaking, only be made with respect to the blocks and the EUs 
that were part of the experiment. A somewhat wider reference population, however ， 
can be considered and used to derive an appropriate model for the observations from a 
GRBD and also to derive appropriate statistical tests. Following Wilk and Kempthorne 
(1955, 1956) and Zyskind (1962) we shall describe briefly such a situation without 
providing all the details. 

Suppose we have a population of B blocks, each block containing S EUs, and a 
population of T treatments. The experiment then consists of selecting at random b 
blocks, s EUs in each block, and t treatments. The selected treatments are then ran¬ 
domly assigned to the EUs in each block such that each treatment occurs r times in each 



324 


CHAPTER 9. RANDOMIZED BLOCK DESIGNS 


Table 9.12 General Forms of E(MS) for GRBD 


Source 

d.f. 

五 (MS) 


Blocks 

6-1 

+ 〜 + 'g' 

3 o T 一 t o o 

+ 了 ra 0T + tra 0 

Treatments 

t-1 

a^+al + al + 

B — b 2 , 2 

B ra 0T + rba T 

Block-Treatment In- 

(b-m-i) 

+^l + crl + 

r<7 lr 

teraction 




Error 

bt(r — 1) 

^1 + (^1 + crl 


Total 

btr — 1 




block, hence s = tr. A derived linear model can be obtained by introducing sampling 
and design random variables to link the conceptual responses and linear functions of 
them to the observed responses much in the same way as we have described the general 
idea earlier. Such a model leads to the same partitioning of the total sum of squares as 
given in Table 9.10, but it leads to different 五 (MS). The E(MS), following Wilk and 
Kempthorne (1956) are given in Table 9.12. 

Here a 2 v and are defined as before and 


a 


2 

u 



1 


B S 


EE4 


5(5-1)^^ u 

tt^)l 


1 




=1 k=l 



E.a 2 


i=l 





with the U{j , {p 丁 ) ik ，defined as before except now in the context of the popula¬ 
tions from which we sample. 


9,7.6 Random Block Effects 

We note that for the case b = B, s = S,t = T the 五 (MS) in Table 9.12 reduce to those 
of Table 9.10. Another extreme and important case is that where B is much larger than 
b (which we denote by 6 <C B), s = S，t = T, Then (B — b)/B ^ 1, and the form of 
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the 五 (MS) in Table 9.12 suggests to test the hypothesis ： n = r 2 = • ♦ 
using the F-test 

^ MS(T) 

""MS (5 X T) 



(9.89) 


with ^ — 1 and (b — l)(t — 1) d.f., which is, of course, different from (9.82). This 
situation is referred to as a mixed model (see Section 4.18) situation with block effects 
as random effects and treatment effects as fixed effects. It follows then also that 


and 



= ^A» r<7 lr)/ br 

k 

= ^2c 2 k MS(BxT)/br, 

k 


(9.90) 


where E/cCfc = 0. 

The situation t 《 T, though not inconceivable, rarely occurs in the context of 
comparative experiments. Hence we shall neither discuss the mixed model with block 
effects fixed and treatment effects random, nor the random effects model with b 《 B 
and t 《 T. 

We conclude this section with the following comments: 


(i) In practical situations it is sometimes not easy to decide whether block effects 
should be treated as random effects. For example, are the blocks in a field exper¬ 
iment randomly selected from a larger population of blocks? Most likely they 
were the only blocks available for the experiment. Or, if the experiment is repli¬ 
cated over two years (setting up a GRBD with nested blocking factors), are those 
years randomly selected? Certainly not, but still the researcher may want to con¬ 
sider them as “random” years. But if one year turns out to be a dry year and 
the other to be a wet year, then clearly we have an intrinsic blocking factor with 
fixed effects. There are obviously many variations of this discussion and thus 
this question becomes rather philosophical and often controversial. Generally, 
we prefer to consider the block effects as fixed effects. 

(ii) If the block effects are considered to be random effects, then this will affect the 
properties of the treatment least squares means, y.k.- We find from model (9.80) 
that, since E{(3i) = 0 and E[(8r)ik} = 0, 

E(y.k.) = " + (9.91) 


and 

var(j/. fc .) = (a} + a\ T + yj /6 

but, for ^ c fc = 0, 


var 



(9.92) 


(9.93) 
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or 

^ 一、 2 MS(5 x T) 

H=? Cfc —^ 

(see (9.90)). We note that it follows from (9.91), (9.92) and (9.93) that also for 
the RCBD with random block effects only var(y.fe) will be affected, that is, the 
variance of a treatment least squares mean will be larger than the corresponding 
variance for the fixed effects case (see Example 9.16 in Section 9.10). 

9.7.7 Using Satterthwaite’s Procedure 

In many situations the “truth” lies between the two extremes; that is, the effects are 
neither “fixed” nor “random” in the Eisenhart (1947) sense. The 五 (MS) in Table 9.12 
give then some idea what the proper “error” term should be for testing hypotheses. For 
example, ifb<B (but B < oo), t = T, in order to test Hq • ，丁 i = t 2 = … =Tt = 0, 
that is, Ho : = 0, we construct a synthetic mean square 

MS(i?) = 6i MS(£) + 0 2 MS(B x T) (9.94) 

such that 

E[MS (/?)] 二 EjMSp)] - rba 2 T 

that is, 

E[MS{R)\ = E[MS{T)\a 2 T = 0}. 

From Table 9.12 we infer immediately that 

01 + 02 = 1 
and 

B-b 

Hence, (9.94) becomes 

MS(i?) = 4 MS(£) 4 - MS(B x T). (9.95) 

JD JD 

To test Ho we then use a procedure due to Satterthwaite (1946) which in general can be 
described as follows: Suppose we have random variables X t (i — 1.2,..., m) which 
are independently distributed as where \Xi — E(Xi) and % is the number of 

d.f. of = 1.2, ... ,m). We then consider a random variable X defined as 

m 

^ (9.96) 

i=l 

that is, a linear combination of independent x 2 -distributed random variables, with 

m 

— \X — 〉: 



9 . 7 . GENERALIZED RANDOMIZED BLOCK DESIGN 


327 


and approximate X by a random variable of the form \x^ v jv where v is determined 
such that X and fixl/u have the same variance. It follows from (9.96) then that 


2[i 2 jv = 2 



or 

In our case we have m = 2 and X\ = MS(£ , ).X 2 = MS(B x T),X = MS(ii), 
(pi = b/B ， (j >2 = {B—b)/B, v\ — bt(r~l). v 2 = (b-l)(t-l). Since \x\ = E[MS(£')] 
and fi2 — E[MS(B x T)] are unknown, we approximate v by 



that is, 


4 MS{E) + MS(S x T) 
B B 


r b j 

2 

\B -b " 

互 MS(E) 

-+ - 

—MS(B x T) 


bt(r — 1) + (b — l)(t — 1) 


The test statistic for an approximate F-test for testing Hq is then given by 


(9.97) 


F 二 MS(T) 
~ MS{R) 


(9.98) 


with {t — 1) and u d.f. 

The synthetic “error” mean square, MS(i2), in (9.98)and as defined in (9.95) is 
also used in approximate tests concerning individual treatment contrasts S Ck 丁 k with 
E Cfe = 0. Specifically, we may use 

(¥_')/ (r.4/ br 

F = MS ⑻ 


with 1 and v [see (9.97)] d.f. as the test statistic for testing Hq : Efc Ck 丁 k = 0, 

For the overall F-test as well as for the test concerning individual contrasts the 
inference space is that of the treatments used in the experiment and the population of 
blocks available for the experiment. Probability statements associated with the tests 
are valid only over this space. One should keep this in mind when extrapolating results 
to entities not in the population. This situation does indeed occur quite often when 
we want to apply the results of the experiment to entities that will arise in the future. 
For example, if the blocks are litters of mice then the results do not automatically 
carry over to litters to be born in the future. We may feel more comfortable with 
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our extrapolation if the future litters bear some relationship to the current litters，for 
example，are obtained from the same strains. The reader will realize that applying the 
experimental results to other species, say humans, will compound the difficulties even 
further. In all of this subject matter knowledge is extremely important (see Chapters 1 
and 2). 

9.8 INCOMPLETE BLOCK DESIGNS 

9,8.1 General Notion of Designs with Incomplete 
Blocks 

In considering the RCBD and the GRBD we have assumed that the blocks contain 
enough homogeneous EUs so that each treatment can be applied once (for the RCBD) 
or r(> 1) times (for the GRBD) in each block. Since the EUs in a block are rarely, 
if ever, homogeneous, large block sizes may be associated with large unit variances, 

As a consequence, the precision of the experiment, that is，its sensitivity to detect 
treatment differences may be adversely affected. It is an empirical fact that, generally, 
smaller blocks are less heterogeneous than larger blocks. Hence the possibility of using 
“small” blocks needs to be explored and should be given consideration in designing an 
experiment. 

Often the experimenter is not given much of a choice in choosing an error-control 
design when blocks arise quite naturally with only few experimental units. As an ex¬ 
treme case, blocks of size two present themselves commonly, for example identical 
twins, half-leaves, the two sides of the body of an individual, such as both arms. Even 
litters of mice, parts of a field, or a batch of raw material may not have enough EUs to 
accommodate all treatments, especially if the number of treatments is large (examples 
of this we shall encounter when we discuss factorial experiments; see Chapter 11). 

These situations give rise to what are called incomplete blocks, and the correspond¬ 
ing error-control designs are referred to as incomplete block designs. The question of 
how one should assign the treatments to the EUs in such blocks becomes then an im¬ 
portant one. Obviously, different arrangements are possible, and some may be better 
than others. In this section we shall discuss, in general terms, several types of incom¬ 
plete block designs. This is meant as an overview to give the reader some familiarity 
with the existence and nature of such designs. Most of the technical details will be 
deferred to Chapters II. 1-6. 

The general situation is as follows: We have t treatments and b blocks; the ith 
block has fcj EUs (i — 1,2,..., 6) and the ^th treatment is replicated ri times (/ = 
1. 2,.... t). This implies obviously 

b t 

J2 k ^ = J2 ri (9.99) 

i-1 1 = 1 

and this number is denoted by n, the total number of EUs. From our earlier discussion 
it is also clear that not all treatments can occur equally often in each block, indeed in 
most situations not every treatment occurs in every block. The actual treatment-block 
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arrangement is characterized by the so-called incidence matrix N — (nu) which is a 
t x b matrix with elements 

nu = numer of times treatment l occurs in block i. 


As a consequence we have 


^nu = h (i = 1. 2,... ,6) 

i 

and 

Y2 nii = ri G = 1 ， 2, … ， t). 

i 

The arrangement of the treatments in blocks cannot be done haphazardly, since this 
may lead to undesirable properties of the design. An important property of a design 
is that of connectedness (see Section 4.12.4) which allows us to estimate all simple 
treatment comparisons of the form r\ — 丁卜 The designs described in the following 
subsections possess this property and, in addition, have some other desirable features. 

Before turning to these designs, however, we shall make a few general remarks 
about the analysis of incomplete block designs. Denoting the m-th observation for the 
Z-th treatment in the i-th block by yu m , we write, assuming unit-treatment additivity in 
the broad sense, 


yilm — + ^ilm (9.100a) 

with i = 1, 2, .. •， b; Z = 1， 2, .. •， m = 1， 2, .. .， nu , or, in matrix notation, 

y = 3 /j. + Xp/3 + X T r + e (9.100b) 

(see model (4.2) in Section 4.3,2). This model has been referred to earlier as a three- 
part linear model (see Section 4.9) or as a two-way classification model (see Sections 
4.10 and 4.13.3). 

The general form of the normal equations (NE) for estimating the effects in (9.100) 
is given in Section 4.13.3. To solve these equations it is convenient and informative 
to reduce the NE to a set of linear equations involving only the 丁 i(l = 1, 2, .. t) by 
absorbing the equations for /j, and the ft into the equations for the 77 . This leads to the 
so-called reduced normal equations (RNE) (for details see section II. 1.3). In matrix 
notation the RNE can be written as 

(R-N K- 1 NOr =T-N K 1 B, (9.101) 

where 



fn 

\ 


(k x 

\ 


T2 

0 


k 2 

0 

R = 

0 • 


and K = 

0 ' 



\ r tJ \ hj 
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Table 9.13 ANOVA for Incomplete Block Design 


Source 

d.f. 

SS* 

E(MS) 


6-1 

yBf G 2 

IT ki n 


X t |3 ， X 々 

t-1 

i ： r lQl 

1=1 

o r f Cr 

^ + t-i 

Error 

n — b — t + 1 

Difference 


Total 

n — 1 

ijm 



*G=grand total, n=total number of observations 


andf = (f 1? r 2 , r t )\ T=(TuT 2 , ..Tj ， B=(Bi, B 2 , ..B s ) / with T 《二 
yu m = Z-th treatment total, Bi = ^^yum = «-th block total. In a more succinct 

i, m L m 

form the RNE in (9.101) are generally written as 

Ct^Q (9.102) 

with C representing the coefficient matrix for r and Q the right-hand side of the RNE. 
The so-called C-matrix above is also referred to as the information matrix as it contains 
all the information concerning the properties of the underlying design. 

The general form of the analysis of variance for an incomplete block design is given 
in Table 9.13. More precisely, the sums of squares given in Table 9.13 are the sequential 
sums of squares for the ordered model (9.100a,b) (Type I SS in SAS terminology). 
This is reflected in the expressions for the sources of variation, a notation established 
in Section 4.10. 

In the terminology of block designs the sum of squares SS(X/3j3) is called the 
SS(Blocks ignoring treatments) and SS(X r |3, X^) is called SS (Treatments adjusted 
for blocks). For other details we refer to Section II.1.3.6. 

We shall now turn our attention to some specific classes of incomplete block de¬ 
signs which can be described by the specific form of C in (9.102). 

9.8.2 Balanced Incomplete Block Designs 

The balanced incomplete block design (BIBD) is an equireplicate (that is, all r/ = r), 
proper (that is, all = k), binary design (that is, nu = 1 or 0) introduced by Yates 
(1936). It has the additional and most important property that every pair of treatments 
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occurs together in a block the same number of times, this number being denoted by A. 
We shall refer to such a design as BIBD (t, 6, k, r; A) indicating thus the parameters of 
the design. For (9.99) we then have 

tr = bk = n 


and 


Tin = k for every i 
i 

J2 n u = r for every 1. 


We also have the following relationship between the parameters 

X{t - 1) = r(fc- 1). (9.103) 

The validity of this relationship can be seen as follows: Consider one treatment, say 
1*. Since l* occurs exactly A times together with the remaining t — 1 treatments, the 
number of EUs occupied by those treatments in the blocks in which r occurs must be 
q =： \{t — 1). On the other hand, since r occurs in r blocks and each block has k EUs, 
the same number q is equal to r(k — 1). 

The relationship (9.103) implies 


A — 


r(k — 1) 



(9.104) 


Since A must be an integer it is clear that a BIBD does not exist for all values of t, 
fc, and r. Even for values of t, k, and r yielding an integer A, a BIBD may not exist. 
In fact there exists only a limited number of BIBDs in the useful parameter range. A 
(incomplete) list of actual plans is given by Cochran and Cox (1957). Raghavarao 
(1971) and Mathon and Rosa (1966) provide a more complete list of parameters of 
existing designs (see also Chapter II.3.4 for a listing of BIBDs for t < 25 and k < 11). 

We give an example of a BIBD below. 


EXAMPLE 9.4: We consider BIBD (6, 10, 3, 5; 2) which can be written as follows, 
each triplet representing the treatments in a block (before randomization): 

1 2 5 2 3 4 

1 2 6 2 3 5 

1 3 4 2 4 6 

1 3 6 3 5 6 

1 4 5 4 5 6 

Equivalently, this design can be expressed in terms of the incidence matrix N as 

"1 1 1 1 1 0 0 0 0 0 " 

1 1 0 0 0 1 1 1 0 0 

_ 0 0 110 110 10 
^ = 0010110101 ' 

1 0 0 0 1 0 1 0 1 1 

0 1 0 1 0 0 0 1 1 1 
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It can be seen easily from N that pairs of Is occur exactly twice, according to 入 = 2, 
that is, = A = 2 for every pair Z, l’(l _ l f ). The way we use this design is 

to randomly assign each triplet to a block and then randomly assign the treatments to 
the EUs within a block, using independent randomizations for the b blocks. □ 

The analysis of observations from a BIBD is based on model (9.100a) which we 
now write simply as 

yu = " + A + r/ + eu (9.105) 

assuming unit-treatment additivity in the broad sense using least squares analysis and 
the ANOVA in Table 9.13 with ki — k for all i. One important point here is that if we 
rewrite (9.105) in matrix form as 

y = + X^/3 + X t t + e 

then we find for the BIBD (and the other incomplete block designs in this chapter) that 

SS(X t |3X 5 ) #5S(X t | ： I) 

(see Section 4.11). This means that for incomplete block designs model (9.105) is a 
nonorthogonal model or, expressed alternatively, incomplete block designs are non- 
ortho gonal designs. As a further consequence of the incomplete block arrangement 
comparisons among treatments can no longer be accomplished by comparing treatment 
means. Rather, the comparisons are made using LS means (see Chapters II. 1 and 2 )， 
because “adjustments” have to be made since not every treatment occurs in every block 
(this applies to all incomplete block designs). 

For the BIBD the C-matrix of (9.101) takes on the form 



(see Section II.2.4.1) with a generalized inverse (see Section 4.4.4) given by 


C" 


k 


r(k — 1) + A 

1 了 


r - - - 

k 

It then follows that for a simple treatment comparison f ； — f\> we obtain 


where, using (9,104 )， 


var(f/ — 子 I，) 



F — 1) 
_ k{t - 1) 


(9.106) 


(9.107) 

(9.108) 
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is referred to as the efficiency factor of the BIBD. It follows from (9.108) that E < 1. 
This would imply that a BIBD is less efficient than the corresponding RCBD with the 
same number of replications r. This ignores, however, the fact that usually 

2 2 
a e(BIBD) < a e(RCBD). 

Hence a fair comparison of var(f/ — t//)bibd and var(f/ — t//)rcbd would depend 
on the relationship between 4 (bibd )/ 五 an< ^ ^(rcbd)* Such information may not 
be available, in particular if a RCBD is not an available option. An estimate of a\ is 
obtained from the ANOVA table (Table 9.13) as = MS(E) = MS(I|3X / 3X r ). A test 
of the hypothesis H 0 ： r\ = T 2 = - • = r t = 0 can also be derived from the ANOVA 
table, using F = MS(X r l3 X / 3)/MS(I|3X r X / 3) with t - 1 and n - b - t 1 d.f. 

9.8.3 Balanced Treatment Incomplete Block Designs 

In many kinds of experiments it is of primary importance to compare several treat¬ 
ments with an established procedure or a control, whereas the comparisons among the 
treatments are only of secondary importance or of no importance at all. As an exam¬ 
ple consider the efficacy of several drugs as compared to a placebo. The comparisons 
among the drugs may not be important as they control different side effects. Another 
example is mentioned by Pearce (1983): In a spraying experiment with insecticides it is 
useful to leave some EUs unsprayed to provide evidence that the infestation is present. 
Of primary interest then is to see to what extent the insecticides control the pest and 
comparisons among them may only be of secondary interest. Even though BIBDs 
can be used in these situations they are usually not the most appropriate designs since 
they consider all treatments as equally important. To emphasize and take into account 
the specific situations discussed above Pearce (1960) considered designs through sup¬ 
plementation and in a more systematic and comprehensive approach Bechhofer and 
Tamhane (1981, 1983) introduced and developed balanced treatment incomplete block 
designs (BTIBD). 

Suppose we have one control treatment and t test treatments, denoted by 0, and 
l = 1 ， 2, •.. ，亡， respectively, with b blocks of size k{k < t + 1). An incomplete block 
design with these parameters is then called a BTIBD if for every l 

var(fo — n) = a 2 (jg (9.109) 

and for every pair l. ^ V) 

cov(f 0 - f/,fo - Ti>) = (9.110) 

where a and p depend on the design employed. Bechhofer and Tamhane (1981) show 
that a necessary and sufficient condition for an incomplete block design to be a BTIBD 
is that every test treatment occurs together A 0 times with the control in the same block 
and any pair of test treatments occur together Ai times in the same block. In terms 
of the elements of the incidence matrix N = (riji)(j = 0.1,2,... ,t:i = 1,2,.... 6) 
these conditions can be written as 
b 

〉 : 几 OiTHi 二 Xq (J 二 1) 2, . . . , t) 
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and 

b 

nwriyi = Ai (l, V = 1,2, 7^ V). 

i=l 

We denote such a design by BTIBD ( 亡， 6 ， /( ： ; 入 0 ，入 1).111 order for all to 一丁 = 
1,2,... ,t) to be estimable we obviously require Ao > 0. 

To illustrate this type of design we consider three examples. 

Example 9.5: BTIBD (4, 6, 3; 3, 1) is given by 

0 1 2 

0 1 3 

0 1 4 

0 2 3 

0 2 4 

0 3 4. 

We note that this design is derived by supplementing each block in the BIBD (4, 6, 2, 
3; 1) with the control 0. As a consequence 0 is replicated ro 二 6 times, whereas the 
test treatments are replicated r = 3 times. □ 

Example 9.6: BTIBD (4,7, 3; 2,2) is given by 

0 1 2 

0 1 4 

0 2 4 

0 0 3 

1 2 3 

1 3 4 

2 3 4. 

We note here that this design is not a binary design since no 4 = 2. Also, ro = 5. r = 4. 
□ 


Example 9.7: Another BTIBD (4, 7, 3; 2, 2) is given by 

0 1 3 

0 1 4 

0 2 3 

0 2 4 

1 2 3 

1 2 4 

3 4 4. 

This is a design with unequal replication for the test treatments, that is, ro = ri = 

r 2 = r 3 = 4, r 4 = 5 } where ri denotes the number of replications for treatment l. □ 
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A list of BTIBDs is provided by Bechhofer and Tamhane (1985) together with their 
properties and methods of construction. 

Least squares analysis based on model (9.100a) yields 


2 _ 尧 ( 入 o + 入 1) 
入 0( 入 0 + 亡入 1) 


for (9.109) and 


入 l 

入 o + 入 1 


for (9.110). For the designs in Examples 9.6 and 9.7 above it follows then that they 
have the same a 2 and p. These designs are said to be equivalent and considerations 
other than statistical may have to be used to decide which design to use in a practical 
situation. 


For a more extensive discussion of various types of BTIBDs see Section II.6.5. 


9.8.4 Partially Balanced Incomplete Block Designs 

We have mentioned earlier that BIBDs exist only for a limited number of parameters 
and often with a large number of blocks. To provide practical alternatives Bose and 
Nair (1939) developed a large class of incomplete block designs, referred to as par¬ 
tially balanced incomplete block designs (PBIBD). Recall that the important property 
of BIBDs is that simple treatment comparisons are estimated with the same variance. 
An obvious relaxation of this requirement is to search for designs which allow two 
types of variances for all t(t - 1)/2 simple treatment comparisons. This property is 
achieved by an important subclass of all PBIBDs, namely the so-called 2-associate 
class PBIBDs. A list of such designs is given by Clatworthy (1973) and more details 
about PBIBDs are provided in Chapters II. 4 and 5. We shall give here just one example 
of such a design to give the reader some insight into the nature of these designs. 

Example 9.8: Suppose we have t = 6 treatments and blocks of size fc = 4. From 
the list of available BIBDs we see that b = 15 blocks are needed for a BIBD with these 
parameters. Suppose we do not have 15 blocks available and hence need an alternative 
design. From Clatworthy (1973) we obtain the following design (his design SI) with 
only b = 3 blocks and r 二 2 replications for each treatment (each row represents a 
block): 

14 2 5 

2 5 3 6 

3 6 14. 

We notice, by inspection, that several pairs of treatments occur together twice in the 
same block (1 and 4, 2 and 5, 3 and 6) and the remaining pairs occur together once 
in the same block. This leads, for each treatment，to a classification of the remaining 
treatments into what are called associate classes. In this case this is done as follows: 
Write the treatments in a rectangular array 


4 
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and declare any two treatments occurring in the same row to be 1. associates and any 
two treatments not occurring in the same row to be 2. associates. This association 
scheme then leads to the following classification: 

Treatment 1. Associates 2. Associates 


1 4 2,3,5, 6 

2 5 1,3,4,6 

3 6 1,2,4,5 

4 1 2,3,5,6 

5 2 1,3,4,6 

6 3 1,2,4,5 


Any two treatments which are 1. associates, for instance, 1 and 4, occur together Ai 
times in the same block, and any two treatments which are 2. associates, for instance, 
1 and 2, occur together A 2 times in the same block. We saw already that for our design 
given above we have 入丄 = 2, A 2 = 1. If in general we write a PBIBD as PBIBD 
(t. b. k. r; Ai, A 2 ) we have in particular here a PBIBD (6, 3,4, 2; 2, 1). 

Because of the fact that Ai ^ A 2 (if Ai = A2 we would have a BIBD), we have that 
any two treatments which are 1. associates are compared with one variance, v\ say, 
and any two treatments which are 2. associates are compared with another variance, V 2 
say, for example, 

var(fi — f 4 ) = V\ 
and 

var(fi - f 2 ) = t> 2 - 

The average variance for treatment comparisons then is 


av. var = 


niVi + n 2 V2 

~ ^1 ~ 


where ni is the number of ith associates (i = 1, 2) with m + ri 2 = t — 1. Analogously 
we can define two efficiency factors E\ and E 2 by 

Vi = ( i = 1 - 2 ) 
and the overall efficiency factor E as 

niE 1 +n 2 E 2 

E = - . 

t-1 

These efficiency factors are useful for comparisons of competing designs. They are 
given together with other relevant parameters in Clatworthy 5 s (1973) tables. For our 
example we find (see Chapter II.4) Vi = ^^2 — 1.16a 】， av. var = HAg^.Ei = 
1.00, E 2 — . 86 . E = . 88 . □ 



9.8. INCOMPLETE BLOCK DESIGNS 


337 


The list of available and practically useful PBIBDs is rather extensive. Since the 
construction of PBIBDs requires in general a certain amount of mathematical machin¬ 
ery (see Chapter II.5) this list is also very convenient to have as a tool for finding 
suitable designs for a given experimental situation. As we shall discuss in later chap¬ 
ters, the notion of PBIBDs in their general form is quite fundamental in other aspects 
of experimental design as well. This includes PBIBDs with more than two associate 
classes (see also Section II.4.6). 


9.8.5 Extended Block Designs 

In the preceding sections we have considered situations in which the block size k is less 
than the number of treatments It is, of course, also possible to encounter experimental 
situations in which 7 t < fe < (7 + l)t, with 7 being a positive integer. Since the case 
7 — 1 is of particular importance and interest we shall confine ourselves to this case 
here, but extensions should be quite obvious. 

Suppose then we have t treatments, b blocks of size k with t < k <2t. It is obvious 
then that each treatment can occur in each block once but not twice. A reasonable 
approach would be to assign in each block t EUs to the t treatments and then fill in 
each block the remaining k — t EUs in an appropriate manner in concordance with the 
objective of the experiment. It is difficult to give any general rules, so we shall give 
only a few examples to illustrate the general idea: 

Example 9.9: Suppose fc = 亡 + 1. If one treatment is of special importance, for 
example, a standard with which all other treatments should be compared, then it might 
be appropriate to assign that treatment to the additional EU in each block. Such a 
design would then have the same properties as a BTIBD (see Section 9.8.3). If no 
treatment is of special importance, then having equal or nearly equal replication for 
each treatment would seem to be appropriate. □ 


EXAMPLE 9.10: Suppose t 1 < k < 2t. One possible approach is to adjoin to the 
RCBD part of the overall design one of the incomplete block designs discussed in the 
previous sections. Consider the case t = 4, k = 6. b = 6. Here we could combine 
a RCBD with a BIBD (4, 6, 2, 3; 1) so that the final design looks like this (before 
randomization) 


Block Treatments 


1 

1234 

12 

2 

1234 

13 

3 

1234 

14 

4 

1234 

23 

5 

1234 

24 

6 

1234 

34 


RCBD 

BIBD 


□ 
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Example 9.11: Suppose t = 4,/c = 7,6 = 6 and treatment 1 is of special impor¬ 
tance. One possible design then is the following (before randomization) 


Block Treatments 


1 

1 

2 

3 

4 

1 

2 

1 

2 

1 

2 

3 

4 

1 

3 

1 

3 

1 

2 

3 

4 

1 

4 

1 

4 

1 

2 

3 

4 

2 

3 

1 

5 

1 

2 

3 

4 

2 

4 

1 

6 

1 

2 

3 

4 

3 

4 

1 



RCBD 


BIBD 







BTIBD 



An important aspect of extended block designs is that they allow easy separation of 
block-treatment interaction and error if that is desirable. For Example 9.11 above the 
partitioning of the total number of d.f. in the ANOVA is as follows: 


Source 

d.f. 

Blocks 

5 

Treatments 

3 

BxT 

15 

Error 

18 

Total 

41 


The error d.f. arise, of course, from comparisons among EUs treated alike in a 
block. 

9.8.6 Some General Remarks 

The classes of designs mentioned in the previous sections represent only a fraction of 
existing incomplete block designs. We have included them here because the designs in 
these classes (i) have certain structures which can be explained quite easily, (ii) are, for 
the most part, of practical value, (iii) are easily analyzed, and (iv) serve very often as 
foundation or building blocks for other designs. 

The fact that these designs have a structure, that is, certain combinatorial and statis¬ 
tical properties, leads to a fairly easy, albeit nonorthogonal analysis. This was certainly 
the major reason for the development of these designs. With today’s computing facil¬ 
ities this is no longer of great importance, but useful nonetheless. Other designs have 
been developed which do not have the kind of structure as BIBDs or PBIBDs but which 
still have certain properties. Among those are the pairwise balanced and variance bal¬ 
anced designs (see John, 1964; Hedayat and Federer, 1974). These are for the most 
part designs with unequal block sizes, unequal numbers of replications and unequal 
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concurrences of treatments in the same block. Even though these designs possess cer¬ 
tain combinatorial properties they are much more difficult to describe in general and 
even more difficult to list. 

With respect to designs with unequal block sizes, care must be exercised in their 
use. One of the basic assumptions for the analysis of block designs is that of equal 
variances within blocks. As we mentioned earlier, the variance tends to increase as the 
block size increases. As long as the block sizes are not too unequal, this should not be 
a major problem. Situations of this type include different litter sizes, different numbers 
of leaves per plant, different block sizes due to irregular shape of experimental field, 
and so forth. 

The structure of the designs discussed ensures also that all treatment contrasts can 
be estimated, in particular all treatment differences can be estimated. We refer to such 
designs as connected designs, whereas designs which do not have this property are 
called disconnected designs (see also Section 4.12.4). To illustrate the generally unde¬ 
sirable property of disconnectedness, consider the following example. 


EXAMPLE 9.12: Consider the incomplete block design with t = 5 and 6 = 4 as given 
by its incidence matrix 

■1 1 0 0 - 
10 10 
N= 0 1 1 0 

0 0 0 1 

0 0 0 1 


We see immediately that the treatments fall into two sets such that treatments in the first 
set, {1,2,3}, do not occur together in the same block with treatments of the second set, 
{4,5}. As a consequence, functions of the form t" - n" with V = 1.2,3 and V = 4,5 
cannot be estimated unbiasedly since, using model (9.105) we cannot “eliminate” the 
block effects. Expressed differently, there does not exist a linear combination of the 
observations, Y^i^auyu say, such that its expected value is t" — 丁 i” for some V. Had 
the design instead been of the form 


N 二 


1 1 
1 0 
0 1 
0 0 
0 0 


0 0 " 
1 0 
1 1 
0 1 
0 1 


with treatment 3 occurring also in block 4 we would have a connected design. Now 
all treatment differences can be estimated as can be seen simply by looking at the 
individual blocks and the treatment comparisons estimable within them and using the 
fact that if r\ —ry and r" — 丁 i” are estimable then also 


(n - t") + (r" - 77 ") =T[- ri" 

is estimable. This brief discussion (for more detail see Chapter II.l) is meant to point 
out that the assignment of treatments to blocks should generally not be done haphaz¬ 
ardly but always with a view towards achieving connectedness. □ 
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Generally, block designs are of major importance in experimental work. Careful 
attention must be given to the availability or formation of blocks and an appropri¬ 
ate arrangement of the treatments must be selected according to the objectives of the 
experiment. Often one blocking factor is not sufficient (we have given examples in 
Section 9.6.7) and in other situations “blocking in different directions” may be called 
for. Examples of that will be discussed in the following chapter. 


9.9 SYSTEMATIC BLOCK DESIGNS 

In our discussion of the various forms of block designs we have emphasized the random 
assignment of treatments to EUs, using independent randomizations for each block. 
The reason for such random assignment is, of course, to avoid any bias in the treat¬ 
ment comparisons (see Chapter 2). There exist, however, situations where it may be 
advantageous to employ allocations of the treatments to EU’s other than random alloca¬ 
tions. One such situation presents itself if the unit contributions Uij [see (9.1)] exhibit 
some sort of smooth trend within the ith block or if the status of the EUs in a block 
change in a gradient fashion as the treatments are applied sequentially. The question 
then arises: Can we utilize this knowledge and allocate the treatments such that this 
leads to “improved” estimation of treatment comparisons vis-a-vis the situation where 
this information is being ignored? 

9.9.1 Dealing with Trends 

Considering a complete block design, we assume that we can express the trend in 
terms of a polynomial of known degree, say p, with p <t,of some characteristic of the 
EUs, denoted by x. One obvious way to proceed is to use random allocation as in the 
usual RCBD and use the information about the trend as supplementary information in 
conjunction with an analysis of covariance model. Let yij denote the observation for 
the jth EU in the ith block. Assume that the trend is the same in each block, that is, the 


trend is a function only of Xj (.7 = 1.2,..., b). We can then write 

t p 

vn = n + Pi + ^2 s ij Tk + H ^i xl j + e h 

= l / = 1 


or, more conveniently, in terms of orthogonal polynomials (see Section 7.4) 

Vij = fi + 0i + Y^ S ij T k + ^2 + e*j (9.111) 

k l 

where 二 1 if treatment k is applied to the jth EU in the zth block, and 0 other¬ 
wise. This model can be simplified even further by taking Xj = j and hence writing 
(9.111) as ' 一 

Uij = ". + A’ + + ⑽ + e h (9-112) 

k l 

We rewrite (9.112) in matrix notation as 

y = Ofi + X^/3 + X t t + X '7 + e* . 


(9.113) 
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We can analyze the data, using model (9.113)，in the usual fashion (see Section 9.4 
and Chapter 4). The important point to keep in mind here is that the sum of squares 
for treatments is of the form SS(X r |3X 7 ) as compared to SS(X T |J) for the RCBD 
without covariate, and that for model (9.113) generally 

SS(X t |3X 7 ) t^SS(X t |3). 

Cox (1951) has pointed out that if a model of the form (9.113) holds the method just 
described leads to a loss of information with regard to treatment comparisons. 

9.9.2 Trend-free Designs 

An alternative method then is to use some systematic arrangement of the treatments 
in a block. Such possibilities were considered first, although in a somewhat different 
context, by e.g., Neyman (1929) and Cox (1951, 1952). One possibility, for exam¬ 
ple is, to repeat the treatments in the same order (if the EUs are layed out along a 
line), for example, T 1 .T 2 .Ts ， T 1 . T 2 .T 3 ...or in a mirror image fashion, for exam¬ 
ple, T 2 , ? 2 , ?i ? 2i, Ti ， Ti, T 2 , T 2 for a linear trend. The idea is to construct what are 
now referred to as trend-free designs, where a design is considered to be trend-free if, 
generally speaking, the sum of squares due to treatments is not affected by the covari¬ 
ate, that is, by the model for the trend. 

For complete blocks the design is trend-free if, in our earlier notation, 

SS(X T |3X 7 ) = SS(X r ]3). 

More generally, for incomplete blocks of size k{< t) and p < k, Bradley and Yeh 
(1980) give the following definition: A block design modeled by (9.113) is trend-free 
relative to the trend in the model if 

SS(X r |3X / 3 X 7 ) = SS(X r |JX /3 ). (9.114) 

They show that a necessary and sufficient condition for a block design to be trend-free 
is that 

X ； X 7 = 0. (9.115) 

The construction of such designs is not straightforward. For the important case of a 
linear trend the existence of trend-free block designs has been shown. For the complete 
block design Yeh and Bradley (1983) prove that a necessary and sufficient condition is 
that either b is even, or both b and t are odd, with b> 3. 

Example 9.13: For t = 7 and 6 = 3 the following design is trend-free: 



Treatments 

Block 1 j 

1 

2 

3 

4 

5 

6 

7 

Block 2 j 

6 

4 

2 

7 

5 

3 

1 

Block 3 

7 

5 

3 

1 

6 

4 

2 

PiU) 

-3 

r 

—1 

! -1 

0 

1 

2 

3 
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This can be verified easily by adding for each treatment the coefficients of the orthog¬ 
onal polynomial of degree 1 (given below the design) corresponding to the positions in 
which the treatment occurs. They add to zero, hence satisfying (9.115). □ 

For the equireplicate, proper, binary block design Stufken (1988) has shown that a 
linear tread-free block design exists if and only if r(/c + 1) is even with r > 2. 

Example 9.14: The following BIBD (5, 10, 3, 6; 3) is trend-free (Bradley and Yeh, 
1980): 



Treatments 

Block 1 

1 

2 

3 

Block 2 

1 

2 

4 

Block 3 

1 

2 

5 

Block 4 

3 

4 

1 

Block 5 

3 

5 

1 

Block 6 

4 

5 

1 

Block 7 

4 

2 

3 

Block 8 

5 

2 

3 

Block 9 

5 

2 

4 

Block 10 

3 

4 

5 

PiU) 

-1 

0 

1 


The reader can verify easily that (9.115) is satisfied. □ 

It is quite obvious from this limited discussion that trend-free block designs do not 
exist for all situations and even if they do exist their construction is not obvious. For 
some construction methods and a generalization of the concept of trend-free to nearly 
trend-free designs we refer the reader to Yeh, Bradley and Notz (1985), and Bradley 
and Odeh (1988). 

The assumption that the trend is the same in each block may not always be re¬ 
alistic. Allowing for different linear trends in the blocks Jacroux et al. (1995) and 
Jacroux (1998) have provided methods of constructing efficient and optimal (see Sec¬ 
tion II. 1.13) designs. It is not surprising that many of these designs have as the basis a 
BIBD or PBIBD, as shown in the following example. 

Example 9.15: (Jacroux, 1998): For a trend-free design with t = 6, 6 = 9, fc = 3 
the following PBIBD with t = 6, 6 = 9, /c = 2: 

Block 

123456789 
Treatments 1 1 1 2 2 2 3 3 3 
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is augumented by replicating the first row to form the final design for i = 6, 6 = 9, 
fe = 3: 


Block 



1 

2 

3 

4 

5 

6 

7 

8 

9 


1 

1 

1 

2 

2 

2 

3 

3 

3 

Treatments 

4 

5 

6 

4 

5 

6 

4 

5 

6 


1 

1 

1 

2 

2 

2 

3 

3 

3 


It is easy to see how this design is trend-free with respect to possibly different linear 
trends within blocks. Obviously, it is also trend-free for a common trend, as we can 
verify easily following arguments given earlier. □ 

9.10 EXAMPLES USING SAS® 

In this section we illustrate some of the analysis procedures described in this chapter, 
with numerical examples. In each case we describe an experimental situation and give a 
data set. The analysis is carried out with the help of SAS procedures (SAS Institute, Inc. 
2002-2003), in particular SAS PROC GLM and SAS PROC MIXED. We shall make 
some comments about the input statements and the output as a link to the developments 
in this chapter. 

Example 9.16: Consider an experiment, using an RCBD, to study weight gain in 
rabbits due to five different diets: 1 = standard, 2=10 protein added, 3 二20 % protein 
added, 4 = additive A,5 = additive B. We use six litters of rabbits, each litter contain¬ 
ing five animals. The litters represent the blocks and the individual animals represent 
the EUs. The data are given in Table 9.14a. 

We use SAS PROC GLM to analyze the data. The input statements are given in 
Table 9.14a: In addition to the ANOVA we perform Tukey’s multiple comparison test 
with a = .10 and we specify a complete set of orthogonal contrasts reflecting the 
structure of the treatments. 

The results of the analysis are given in Table 9.14b, and we comment briefly on 
some aspects of the output: 

(i) Since the data set is balanced the Type I and III SS are identical and equal to 
those obtainable from Table 9.2. 

(ii) Since PROC GLM is a general linear models program that cannot distinguish be¬ 
tween data from observational or intervention studies it automatically performs 
tests of significance for all effects specified in the model statement. We should, 
therefore, ignore the 尸 -value for testing block (litter) effects. The P-value for 
diets (0.0282) indicates that there exist differences among the diets. 


(iii) Tukey’s test at a = .10 indicates a significant difference only between diet 1 and 
diet 5. 



344 


CHAPTER 9. RANDOMIZED BLOCK DESIGNS 


Table 9.14 Randomized Complete Block Design (t = 5, 6 = 6) 


a) Input statements: 

data weight; 
input diet litter gain 
datalines; 

1 1 57.0 1 2 55.0 1 3 62.1 1 4 74.5 1 5 86.7 1 6 42.0 

2 1 64.8 2 2 66.6 2 3 69.5 2 4 61.1 2 5 91.8 2 6 51.8 

3 1 70.7 3 2 59.4 3 3 64.5 3 4 74.0 3 5 78.5 3 6 55.8 

4 1 68.3 4 2 67.1 4 3 69.1 4 4 72.7 4 5 90.6 4 6 44.3 

5 1 76.0 5 2 74.5 5 3 76.5 5 4 86.6 5 5 94.7 5 6 43.2 

run; 

proc glm data=weight; 

class diet litter; 

model gain=litter diet; 

means diet/Tukey alphas. 10; 

lsmeans diet/stderr; 

contrast ’ 1 vs rest’ diet 4 -1 -1 -1 -1; 

estimate ’ 1 vs rest’ diet 4 -1 -1 -1 -I; 

contrast ’ 1 vs rest’ diet 4 -1 -1 -1 -1; 

contrast ’2+3 vs 4+5 ! diet 0 1 1-1-1; 

contrast ’2 vs 3’ diet 0 1-10 0; 

contrast ’4 vs 5’ diet 0 0 0 1 -1; 

title 1 ’RANDOMIZED COMPLETE BLOCK DESIGN (t=5 ; b=6)，； 
title2 ’ANALYSIS OF VARIANCE W/POST-HOC COMPARISONS’； 

run; 

proc mixed data-weight; 
class diet litter; 
model gain=diet; 
random litter; 
lsmeans diet; 

contrast ’I vs rest’ diet 4 - I -1-1-1; 

title2 'ASSUMING RANDOM BLOCK EFFECTS'; 

run; 

b) Output: 


RANDOMIZED COMPLETE BLOCK DESIGN ( 二 =5, b=6) 
ANALYSIS OF VARIANCE W/POST-HOC COMPARISONS 

The GLM Procedure 


Class Level Information 


Class 

Levels 

Values 


diet 

5 

12 

3 4 

5 

litter 

6 

12 

3 4 

5 6 
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Table 9.14 (Continued) 


Number of Observations Read 30 
The GLM Procedure 


Dependent Variable : gain 


Source 

DF 

Sum of 
Squares 

Mean Square 

F Value 

Pr > F 

Model 

9 

4915.622667 

546.180296 

15.54 

<.0001 

Error 

2C 

702.752000 

35.13760C 



Corrected Total 

29 

5618.374667 





R-Square Coeff Var Root MSE gain Mean 

0.874919 8.677219 5.9276S8 68.31333 


Source 

DF 

Type I SS 

Mean Square 

F 

Value 

Pr > F 

litter 

5 

4438.014667 

887•602933 


25.26 

<.0001 

diet 

4 

477.60800C 

119.402000 


3.40 

0.0282 

Source 

DF 

Type III SS 

Mean Square 


Value 

Pr > F 

litter 

5 

4438.014667 

887.602933 


25.26 

<.0001 

diet 

4 

477.608000 

119.402000 


3.40 

0.0282 


Tukey / s Student!zed Range (HSD) Test for gain 


Alpha 0.1 
Error Degrees of Freedom 20 
Error Mean Square 35.1376 
Critical Value of Studentized Range 3.73641 
Minimum Significant Difference 9.042 


Means with the same letter are not significantly different. 


Tukey Grouping Mean N diet 

A 75.250 6 5 

A 

B A 68.683 6 4 

B A 

B A 67.600 6 2 

B A 

B A 67.150 6 3 

B 
B 


62.883 


6 


1 



Pr > t 


Table 9.14 (Continued) 


Least Squares Means 

Standard 

diet gain LSMEAN Error 

1 62.8833333 2.4199725 

2 67.6000000 2.4199725 

3 67.1500000 2.4199725 

4 68.6833333 2.4199725 

5 75.2500000 2.4199725 
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•—-1 i—4 rH fi i ― I 

o o o o o 
o o o o o 
o o o o o 


Covariance Parameters 
Columns in X 
Columns in Z 


2 

6 

6 
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<.0001 

<.0001 

<•◦001 

<•0001 

<.0001 


diet 

1 

62.8833 

5.8542 

diet 

2 

67.6000 

5.8542 

diet 

3 

67.1500 

5.8542 

diet 

4 

68.6833 

5.8542 

diet 

5 

75.2500 

5.8542 


Table 9.14 {Continued) 


Iteration History 

Iteration Evaluations -2 Res Log Like 


213.05776599 

185.03378213 


Criterion 


0.00000000 


Convergence criteria met 


Covariance Parameter 
Estimates 


Estimate 


litter 

Residual 


170.49 

35.1376 


Fit Statistics 

-2 Res Log Likelihood 185. 
AIC (smaller is better) 189. 
AICC (smaller is better) 189. 
BIC (smaller is better) 188. 


Type 3 Tests of Fixed Effects 


Effect 

diet 


Num 

DF 


Den 

DF 


20 


F Value 
3.40 


Pr > F 
0.0282 


Contrasts 


Label 
1 vs rest 


Num 

DF 


Den 

DF F Value 


20 


6.29 


Pr > F 
0.0208 


Effect 


Least Squares Means 


diet 


Estimate 


Standard 

Error 


t Value 


Pr > Itl 


4 5 7 3 5 
7 5 4 7 8 

0 1112 
IX I~I i—I i — I i—I 


o o o o o 
2 2 2 2 2 
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(iv) The reader can verify easily that the sum of the contrast SSs is equal to the diet 
SS. 

(v) The tests concerning the set of orthogonal contrasts indicates that the difference 

among the diets is due mainly to the difference between diet 1 and the average 
of the new diets (P = 0.0208)，the estimated difference between the weight gains 
being 27.15/4 grams. □ 


Example 9.17: We consider the same experimental situation and data as given in 
Example 9.16，except that we now consider the litter (block) effects to be random 
effects (see Section 9.7.6). We use SAS PROC MIXED to analyze the data. The input 
statements are given in Table 9.14a and the output in Table 9.14b. 

We provide the following comments: 

(i) The variance component estimates are obtained as a\ = 170.49 and d\ — 35.14, 
the latter being the same as for the fixed effects model in Example 9.16. 

(ii) Tests of hypotheses for diets and contrasts among diets are the same as in Exam¬ 
ple 9.16. 

(iii) The LS means are the same as those in Example 9.16, but the standard errors are 
larger, 5.82 vs. 2.42 in Example 9.16, reflecting the wider inference space. □ 


Example 9.18: This example describes an experiment using an RCBD with sub¬ 
sampling. Suppose we want to compare the effect of three exercise regimens, say no 
exercise and two different forms of exercise. We have five patients (subjects) and each 
subject performs all three exercises in random order (after appropriate resting periods). 
Immediately after the exercise the blood pressure is taken twice (one measurement 
right after the other). Suppose the data for the diastolic pressure are as given in Table 
9.15a. ^ 

We use both PROC GLM and PROC MIXED. The main purpose of using PROC 
GLM is to obtain the ANOVA table as outlined in Table 9.3. 

The input statements are provided in Table 9.15a and the output in Table 9.15b. We 
make the following comments: 

(i) For both PROC GLM and PROC MIXED we have to describe the experimental 
error in technical terms, which is formally the (negligible) subject-exercise in¬ 
teraction. In addition, for PROC GLM this term has to be identified explicitly 
for any tests concerning the exercise effects, that is, overall test in the ANOVA, 
multiple comparison tests, contrast tests, as well as for obtaining the standard 
error for LS means. In PROC MIXED this will be achieved automatically by 
declaring the subject-exercise interaction as a random effect. 
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Table 9.15 Randomized Complete Block Design with Subsampling 


a) Input statements: 

data pressure; 

input subject exercise diast 

datalines; 

1 1 126 1 1 129 1 2 137 1 2 135 1 3 135 1 3 136 

2 1 134 2 1 138 2 2 140 2 2 145 2 3 141 2 3 139 

3 1 120 3 1 119 3 2 13032 13433 13033129 

4 1 137 4 1 J34 4 2 147 4 2 144 4 3 143 4 3 147 

5 1 123 5 1 123 5 2 1365 2 135 5 3 134 5 3 136 


run; 

proc glm data=pressure; 
class subject exercise; 

model diast = subject exercise suject*exercise; 
test h=exercise e=subject*exercise; 

lsmeans exercise/stderr pdiff adjust=Tukey e=subject*exercise; 

contrast ’1 vs 2+3 ’； 

exercise 2 -1 -l/e=subject*exercise; 

title 1 ’RANDOMIZED COMPLETE BLOCK DESIGN ’； 

title2 ’WITH SUBSAMPUNG ’； 

title3 ’ (t=3, b=5, n=2 )，； 

run; 

proc mixed data-pressure; 
class subject exercise; 
model diast subject exercise; 
random subject*exercise; 
lsmeans exercise/pdiff adjust=Tukey; 
contrast ’1 vs 2+3’ exercise 2-1 -1; 
estimate ’ 1 vs 2+3’ exercise 1 -.5 -.5; 
run; 

b) Output: 


RANDOMIZED COMPLETE BLOCK DESIGN 
WITH SUBSAMPLING 
(t=3, b=5; n=2) 

The GLM Procedure 


Class Level Information 


Class 


Levels Values 


subject 


1 2 3 4 5 


exercise 


3 


Number of Observations Read 30 

Number of Observations Used 30 
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Table 9.15 (Continued) 


The GLM Procedure 

Dependent Variable : diast 




Sum of 





Source 

DF 

Squares 

Mean Square 

F 

Value 

Pr > F 

Model 

14 

1541.466667 

110.104762 


28.48 

<.0001 

Error 

15 

58.000000 

3.866667 




Corrected Total 

29 

1599.466667 






R-Square 

Coeff Var 

Root MSE 

diast 

Mean 



0.963738 

1.461633 

1_ 966384 

134 

.5333 


Source 

DF 

Type I SS 

Mean Square 

F 

Value 

Pr > F 

subject 

4 

905.1333333 

226.2833333 


58.52 

<.0001 

exercise 

2 

591.2666667 

295.6333333 


76.46 

<■0001 

subject*exercise 

8 

45.0666667 

5.6333333 


1.46 

0.2522 

Source 

DF 

Type III SS 

Mean Square 

F 

Value 

Pr > F 

subject 

4 

905.1333333 

226.2833333 


58.52 

<.0001 

exercise 

2 

591.2666667 

295.6333333 


76.46 

<.0001 

subject*exercise 

8 

45.0666667 

5.6333333 


1.46 

0.2522 

Tests of Hypotheses 

Using the 

Type III MS for 

subject^exercise as an ] 

Hrror Term 

Source 

DF 

Type III SS 

Mean Square 

F 

Value 

Fr > F 

exercise 

2 

531.2666667 

295.6333333 


52.48 

<.0001 


Least Squares Means 

Adjustment for Multiple Comparisons : Tukey 

Standard Errors and Probabilities Calculated Using the Type III MS for 
subject *exercise as an 




Error Term 





Standard 


LSMEAN 

exercise 

diast LSMEAN 

Error 

Pr > 111 

Number 

1 

128.300000 

0.75C555 

<.0001 

1 

2 

138.300000 

0.750555 

<.0001 

2 

3 

137.000000 

0.750555 

<.0001 

3 
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Dependent 
Tests 
Contrast 
1 vs 2 + 3 


Table 9.15 (Continued) 


Least Squares Means for effect exercise 
Pr > it I for HO: LSMean(i)=LSMean(j) 

Dependent Variable : diast 


i/j 


<.0001 

<.0001 


2 

<•0001 

0.4726 


<•0001 

0.4726 


Variable : diast 

of Hypotheses Using the Type III MS for subject*exercise as an Error Term 


DF 

1 


Contrast SS 
582.8166667 


Mean Square 
582.8166667 


F Value 
103.46 


Pr > F 
<■0001 


The Mixed Procedure 


Model Information 


Data Set 

Dependent Variable 
Covariance Structure 
Estimation Method 
Residual Variance Method 
Fixed Effects SE Method 
Degrees of Freedom Method 


WORK.PRESSURE 
diast 

Variance Components 

REML 

Profile 

Model-Based 

Containment 


Class 

subject 

exercise 


Class Level Information 
Levels Values 


12 3 4 5 
12 3 


Iteration 


Iteration History 

Evaluations -2 Res Log Like 

： 112.23380945 

1 111.85203058 


0,00000 00'S 


Convergence criteria met. 
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Table 9.15 (Continued) 


Covariance Parameter 
Estimates 


Cov Parm Estimate 

subject*exercise 0.8833 

Residual 3.8667 

Type 3 Tests cf Fixed Effects 

Num Den 

DF DF F Value Pr > F 

4 8 4C.17 <.0001 

2 8 52.48 <.0C0I 

Estimates 
Standard 


Label 

Estimate 


Error 

DF t 

Value 

Pr > 1u | 

1 vs 2+3 

-9.350C 


0.9192 

8 

-10.17 

<.OOC1 




Contrasts 






Nurn 

Den 





Label 

DF 

DF 

F Value 

Pr > F 



1 vs 2+3 

1 

8 

103.46 

<.0001 



Effect 

subject 

exercise 


Least Squares Means 


Effect 

exercise 

Estimate 

Standard 

Error 

DF 

t Value 

Pr > 11 | 

exercise 

1 

128.30 

0.7506 

8 

170.94 

<.0001 

exercise 

2 

138.30 

0.7506 

8 

184.26 

<.0001 

exercise 

3 

137.00 

0.7506 

8 

182.53 

< ■ 0C01 


Differences of Least Squares Means 
Standard 

Differences of Least Squares Means 


Zf fee- exercise 一 exercise Estimate Xrror DF t Value ?r > 1 1 1 Adjustrr.sn: Ad; ? 


exercise 

1 

2 

-1C.0000 

1.0614 

8 

-9.42 

<.0001 

Tukey 

<.0001 

exercise 

1 

3 

-8.7000 

1.0614 

8 

-8.20 

<.0001 

Tukey 

<.0001 

exercise 

2 

3 

1.3000 

1.0614 

8 

1.22 

0.2555 

Tukey 

0.472 
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(ii) The output shows that there exist significant differences among the types of exer¬ 
cise (P < .00010), but that exercises 2 and 3 are not significantly different from 
each other (P = .47). 

(iii) The standard errors for the LS means are the same for both PROC GLM and 
PROC MIXED, as they should be for balanced data (they may be different for 
unbalanced data due to difference estimation procedures for estimating variance 
components; see Section II. 1.11.2). 

(iv) The estimates for o 2 e and are given by PROC MIXED as g\ = 0.8833 and 

dl = 3.8667. □ 

Example 9.19: The setting for this experiment using an RCBD with a nested block¬ 
ing structure and subsampling is described in Example 9.1 (we have changed the num¬ 
bers of half-sib families (HSF) and blocks per location in order to save space). Suppose 
we have obtained the data (height in cm) given in Table 9.16. 

In order to obtain the ANOVA table given in Table 9.6.8 we use SAS PROC GLM. 
The input statements and the results are given in Table 9.16. We also illustrate how to 
use SAS PROC MIXED with input statements given in Table 9.16a and the output in 
Table 9.16b. We make the following comments: 

(i) For PROC GLM we have to provide a technical expression for the experimental 
error, which in this case is equal to the HSF x block (location) interaction. This 
term, that is, the corresponding MS is used to test hypotheses about HSF and 
HSF x location interaction. 

(ii) In PROC MIXED HSFxblock(loc) is considered to be a random effect, and cor¬ 
rect tests about HSF and HSF x location interaction are performed automatically. 
Both tests are significant (P< 0.0001). A look at the locxHFS LS means shows 
that the interaction is codirectional. Hence the test about HSF (averaged over 
locations) seems appropriate. 

(iii) The SLICE option in the LS means input statement is one way to investigate a 

significant interaction, in particular, if the interaction turns out to be antidirec- 
tional. We have included it here to indicate this option to perform the ANOVA 
separately for each location and provide the F-test for HSF for each location. 
Note that Denominator DF=12 indicates that the pooled experimental error has 
been used. □ 


EXAMPLE 9.20: We consider here a fertilizer study involving two small grain vari¬ 
eties. The fertilizer is nitrogen (N) at five increasing levels (by the same amount). The 
field experiment is laid out as a GRBD, with the varieties representing the blocks, and 
each nitrogen level being applied to two EUs for each variety, that is, we have ^ = 5, 
fo = 2, r = 2. Suppose we obtain the yield data given in Table 9.17a. 

We use SAS PROC GLM to analyze the data. The input statements and the output 
are given in Table 9.17. We make the following comments: 



run; 

proc glm data=pine; 
class loc block HSF; 

model height=loc block(loc) HSF loc*block(loc); 

test h=exercise e=subject*exercise; 

title 1 ’RCBD WITH NESTED BLOCKING STRUCTURE'; 

title2 ’AND SUBSAMPLING ，； 

title3 ’ [t=3, b=8 (A-2, C-4), n=2 ]，； 

run; 

proc mixed data=pine; 
class loc block HSF; 

model height=loc block(loc) HSF loc*HSF; 
random HSF*block(loc); 
lsmeans HSF loc*HSF/ slice=loc; 

run; 

b) Output: 


RCBD WITH NESTED BLOCKING STRUCTURE 
AND SUBSAMPLING 
[t=3, b=8 (a=2,c=4>, n=2] 

The GLM Procedure 

Class Level Information 


Class 

Levels 

Values 

loc 

2 

1 

2 

block 

4 

1 

2 3 

HSF 

3 

1 

2 3 


Number of Observations Read 48 

Number of Observations Used 48 
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Table 9.16 RCBD with Nested Blocking Structure and Subsampling 

a) Input statements: 


00469050 
91017909 
12221121 
33333333 
12341234 
11112222 
71512372 
90018899 
12 2 2 1 . —- 1 . — . 


11112222 
01733154 
67789090 
22221212 
22222222 
,， 12341234 
@ 11112222 
@ 25101807 
t 56779909 
ht22221121 
22222222 
•le12341234 
1111222 2 
5F14515432 
0 21237889 
^ 22221111 
11 11 - - - 1L * - - * 11 1L 

0C 12341234 
5bl 11112222 
lecs;02048094 
• ss el2227888 
PI ^22221111 

a^I^ 11 11 li 11 11 IX 11 

atpat12341234 
dlndllll 2222 
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loc 

block(loc) 

HSF 

loc*HSF 
block*HSF(loc) 


1 

6 

2 

2 

12 


20336.33333 
1462.33333 
12x70.66667 
6511.16667 
301.16667 


20336.33333 

243.72222 

6085.33333 

3255.58333 

25.09722 


63 <.0001 
06 <.0001 
08 <.0001 
70 <.0001 
14 0,3763 


Source 

DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

loc 

1 

20336.33333 

20336.33333 

922 

• 63 

<. 0001 

block(loc) 

6 

1462.33333 

243.72222 

11 

• 06 

<, 0001 

HSF 

2 

12170.66667 

6085.33333 

276 

.08 

<. 0001 

Ioc*HSF 

2 

6511.16667 

3255.58333 

147 

.70 

<• 0001 

b!ock*HSF(loc) 

12 

301.16667 

25.09722 

1 

.14 

G.3769 


Tests 

of Hypotheses Using the Type III MS 

for block*HSF(loc) as an 

Error Term 

Source 

DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

HSF 

2 

12170.66667 

6085.33333 

242.47 

<.0001 

loc*HSF 

2 

6511.16667 

3255.58333 

129.72 

<■0001 


The Mixed Procedure 

Covariance Parameter 
Estimates 

Cov Parm Estimate 


Table 9.16 (Continued) 

The GLM Procedure 


Dependent Variable : height 


Source 

DF 

Sum of 
Squares 

Mean Square 

F Value 

Model 

23 

40781.66667 

1773.11594 

80.44 

Error 

24 

529.00000 

22.04167 


Corrected Total 

47 

41310.66667 




Pr > F 
<.0001 


R-Square 

0.987195 


Coeff Var 
2.228571 


Root MSE 
4.694855 


height Mean 
210.6667 


DF 


Type 工 SS 


Mean Square F Value Pr > F 


16 7 

17 4 


block*HSF(loc) 
Residual 


1.5278 

22.0417 
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81C.30 <.0001 

S.7 ： 0.C005 

242.47 <.0001 

129.72 <.00CI 


loc 

block(loc) 
HSF 

loc*HSF 


Effect 

loc 

HSF 

Least 

Estimate 

Squares Means 

Standard 

Error 

DF 

t Value 

?r > |t; 

HSF 


1 

202.00 

1.2524 

12 

161 

.29 

<•0001 

HSF 


2 

233.00 

1.2524 

12 

186 

.04 

<.0001 

HSF 


3 

197.CO 

1.2524 

12 

：57 

.25 

<.0001 

loc*HSF 

1 

1 

220.88 

1.7712 

12 

124 

• 70 

<.0001 

loc*HSF 

1 

2 

268.63 

1.7712 

12 

151 

.66 

<.0CC1 

10C*HSF 

1 

3 

204.25 

1.7712 

12 

115 

• 32 

<.0001 

loc*HSF 

2 

1 

183.13 

1.7712 

12 

103 

.39 

<. OC-Gl 

loc*HSF 

2 

2 

197.38 

1.7712 

12 

111 

.44 

<.0001 

loc^HSF 

2 

3 

189.75 

1.7712 

12 

107 

■ 13 

<.0001 



Tests 

of 

Effect 

Slices 




Num 

Den 



Effect 

loc 

DF 

DF 

F Value 

Pr > F 

loc*HSF 

1 

2 

12 

355.98 

<.C001 

loc*HSF 

2 

2 

12 

16.21 

0.0004 


(i) The ANOVA is given as outlined in Table 9.10. The N effects are significantly 
different (P< .0001) and there is also significant variety xN interaction (P< 
.0072). 

(ii) Since we have quantitative treatments here, a post-hoc analysis should be a trend 
analysis. For the overall N-levels we find a significant linear and quadratic trend 
(P= .0006 and P< .0001 ， respectively), and the N LS means indicate that level 
3 provides the highest yield at 149.0. 

(iii) However, since the varxN interaction is significant, it seems appropriate to per¬ 

form the trend analysis separately for each variety. The results show essentially 
the same trends as in (ii), but the linear trend for variety 2 is not significant 
(P< .3325). As a result, the highest yields occur for different levels of N，namely 
level 4 (at 150.0) for variety 1 and level 2 (at 159.0) for variety 2. □ 


Table 9.16 (Continued) 


Type 3 Tests of Fixed Effects 


Ef f ec 


Num 

DF 


Den 

D? 


F Value 


Pr > F 


2 2 2 2 
^ —- Ii i — I 
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Table 9.17 Generalized Randomized Block Design 


a) Input statements: 
data fert; 

input N var y @ @; 
datalines; 

11 104 1 1 114 1 2 109 1 2 124 

2 1134 2 1 130 2 2 154 2 2 164 

3 1 146 3 1 142 3 2 152 3 2 156 

4 1 150 4 I 15042 14042 135 

5 1 133 5 1 1465 2 131 52 137 

run; 

proc glm data=fert; 
class var N; 
model y=var N var*N; 
lsmeans N var*N/stderr; 
contrast 'N-linear* N -2 - I 0 J 2; 
contrast ’N-quad’ N 2 -1 -2-12; 

contrast 'N-linear varP N-2-1 0 1 2 var*N -2-1 0 1 20 0 0 0 0; 
contrast ’N-quad varl’ N 2 -1 -2 -1 2 var*N -2-10 1 200000; 
contrast 'N-linear var2’ N-2-1 0 1 2 var*N 0 0 0 0 0-2-1 0 1 2; 
contrast ’N-quad var2’ N 2 -1 -2-12 var*N 0 0 0 0 0 2-1 -2-1 2; 
title 1 ’GENERALIZED RANDOMIZED BLOCK DESIGN ’； 
title2 ’ (t=5,b=2 r=2 )’ ； 

title3 ’ ANOVA WITH POST-HOC TREND ANALYSIS'; 

run; 

b) Output: 


GENERALIZED RANDOMIZED BLOCK DESIGN 
(t=5, b=2 r=2) 

ANOVA WITH POST-HOC TREND ANALYSIS 
The GLM Procedure 
Class Level Information 


Class Levels Values 

var 2 12 


N 


Number of Observations Read 20 

Number cf Observations Used 20 
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2.964372 <.0001 

2.964372 <.0001 

2.964372 <.0001 

2.964372 <.0001 

2.964372 <.0001 


var N y LSMEAN 

1 1 109.000000 

i 2 132.000000 

1 3 144.000000 

1 4 150.000000 

1 5 139.500000 

21 116.500000 

22 159.000000 

23 154.000000 

24 137.500000 

25 134.000000 


Standard 

Error 

Pr > It! 

4.192255 

<.0001 

4.192255 

<.0001 

4.192255 

<.0001 

4.192255 

<.0001 

4.192255 

<.0001 

4.192255 

<.0001 

4 .192255 

<.0001 

4.192255 

<.0001 

4.192255 

<.0001 

4,192255 

<.0001 


Variable : y 


Table 9.17 (Continued) 

The GLM Procedure 


ted Total 


DF 

9 

10 

19 


Sum of 
Squares 

4465.450000 

351.500C00 

4816.950000 


Mean Square 
496.161111 
35.150000 


F Value 
14.12 


Pr > F 

0.0001 


R-Square Cceff Var Root MSE y Mean 

0.927C29 4.310246 5.S28744 137.5500 


DF 

1 

4 

4 


Type I SS 

Mean Square 

F 

Value 

Pr > F 

140.450000 

140.450000 


4.00 

0.0735 

3393.700000 

848.425000 


24.14 

<.0001 

931.300000 

232.825000 


6.62 

0.0072 

Type III SS 

Mean Square 

F 

Value 

Pr > F 

140.450000 

140.450000 


4.00 

G.0735 

3393.700000 

848.425000 


24.14 

<.0001 

931.300000 

232.825000 


6.62 

C.0072 


Least Squares Means 
Standard 


y LSMEAN 


Error 


Pr > it I 


Dependent 

Source 

Model 

Error 

Correc 


var 

N 

var »N 


var 

N 

var *N 


o o o o o 
o o o o o 
o o o o o 
o o o c o 

5 0 0 5 5 
7 5 0 7 7 


14 4 4 3 

i 1 i — I rH I — I 
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Table 9.17 (Continued) 


Dependent Variable : y 


Contrast 

DF 

Contrast SS 

Mean Square 

F Value 

Pr > F 

N-linear 

1 

855.625000 

855.625000 

24.34 

0 _ 0006 

N-quad 

1 

2225.160714 

2225.160714 

63.30 

<.0001 

N-linear varl 

1 

1248.200000 

1248.200000 

35.51 

0.0001 

N-quad varl 

1 

761.285714 

761.285714 

21.66 

0.0009 

N-linear var2 

1 

36.450000 

36.450000 

1.04 

0.3325 

N-quad var2 

1 

1530.321429 

1530.321429 

43.54 

<•0001 


Example 9.21: This example is intended to illustrate some features of the analysis 
of incomplete block designs, in particular the balanced incomplete block design. We 
consider here the BIBD (3, 3, 2, 2; 1) (which is not particularly useful from a practical 
point of view) with the data given in Table 9.18a. 

We comment below on the input statements given in Table 9.18a and the output 
contained in Table 9.18b: 

(i) As options in the model statement we include “inverse” and “solution”. The 7x7 
inverse given in the output is, of course, a g-inverse of the coefficient matrix of 
the NE. It is obtained by imposing the conditions (3s = 0, ?3 = 0. This is also 
reflected in the vector of solutions obtained in this way. For example, we have 

— 16.0, ?2 = 3.0, % = 0, so that we can obtain 

ri - r 2 = 16.0-3.0= 13.0 
fi - r 3 = 16.0 - 0=16.0 

f 2 - f 3 = 3.0 - 0= 3.0 

as the estimates of treatment differences. We emphasize here that the solution 
vector can only be used to obtain estimates of estimable functions, in particular 
treatment contrasts. 

(ii) The variance of the estimate of a treatment contrast can be obtained from the 
g-inverse matrix (see Section 4.16.2) as follows: Consider the 3 x 3 sub-matrix 
corresponding to the treatment effects 

/1.333 0.667 0 、 

0.667 1.333 0 

\ 0 0 0 / 

We then obtain, for example, 

var(fi - f 2 ) = (1.333 —2-0.667+ 1.333 ) 的 
=1.333 MS ⑻ 

=1.333 • 1.5 = 2 
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From this we obtain the standard error (se) as 

se(ri — 5) = v^2 = 1.414 

which agrees with the value given in the output. 

(iii) Looking at all possible treatment differences we see also that they all have the 
same standard error (1.414), a property of the BIBD. 

(iv) In the LSmeans statement we have included the option “e”. The resulting “Co- 
efficients for trt Least Square Means” tells us how the LS mean for a particular 
treatment is obtained from the solution vector (since this is no longer simply the 
treatment mean). We illustrate this with the following example for treatment 1: 


LS mean (Ti) = 1 • jS + .333(3i + 爲 + 爲 ) + 1 • fi 
=19.5 + .333(—12.0 — 6.0 - 0) + 16.0 
=19.5-6.0+16.0 = 29.5 

The g-inverse can then be used, as illustrated earlier, to obtain the standard error 
for the LS means. □ 


Example 9.22: This is a continuation of Example 9.21 to illustrate the use of the 
reduced normal equation (RNE) in analyzing data from an incomplete block design. 

In Table 9.19a we give the input statements, and Table 9.19b contains the output. 
We comment on both briefly as follows: 


(i) The main feature in the input statement is the “absorb block” statement. This 
results in absorbing the equations for [x and ， / ?i, 02 , 03 into the equations for ri, 
r 2 , T 3 to obtain the RNE. 

(ii) The C-matrix (see (9.101)) can be displayed by including the “xpx” statement, 
and the “inverse” statement displays a g-inverse C _ , obtained by imposing the 
condition = 0. Note that this g-inverse is different from the expression given 
in (9.105) which is obtained by imposing the condition fi H- r 2 + 73 = 0. But for 
both choices of a g-inverse estimates for treatment contrasts will be identical. 

(iii) The result of the test for Ho : ri = 巧 =T 3 二 0 is the same as that given in 

Table 9.18b. — 

(iv) By using the “absorb” option we cannot ask for treatment LS means because for 

this we need the solutions jS ， 0i, 02 , fh which are not available now. □ 
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Table 9.18 Balanced Incomplete Block Design 


a) Input statements: 

data BIBD; 
input block trt y; 
datalines; 

1 1 24 
1 2 10 

2 1 29 

2 3 14 

3 2 23 
3 3 19 

run; 

proc glm data: B IB D; 
class block trt: 

model y=block trt/ inverse solution; 
lsmeans trt/stderr e; 
estimate *1 vs T trt I -1 0; 
estimate ’ 1 vs 3’ trt 1 0 -1; 
estimate ’ 1 vs 3’ trt 0 1 -1; 

title 1 ’BALANCED INCOMPLETE BLOCK DESIGN’; 
title2 ’ (t-3, b=3, k=2, r=2, lambda-1 )’； 
title3 ’ANALYSIS OF VARIANCE ’； 
title4 ’W/POST-HOC ANALYSIS ’； 

run; 

b) Output: 


BALANCED INCOMPLETE BLOCK DESIGN 

(t=3, b=3, k=2, r=2, larr.bda=l) 
ANALYSIS OF VARIANCE 
W/POST-HOC ANALYSIS 

The GLM Procedure 

Class Level Information 

Class Levels Values 

block 3 123 

trt 3 123 


Number of Observations Read 
Number of Observations Used 


6 

6 
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Table 9.18 {Continued) 

The GLM Procedure 


X 7 X Generalized Inverse (g2) 



Intercept 

block 1 

block 2 

block 3 

Intercept 

0.8333333333 

-0.333333333 

-0.666666667 

0 

block 1 

-0.333333333 

1.3333333333 

0.6666666667 

0 

block 2 

-0.666666667 

0.6666666667 

1.3333333333 

0 

block 3 

0 

0 

0 

0 

trt 1 

-0.333333333 

-0.666666667 

-0.333333333 

0 

trt 2 

-0.666666667 

-0.333333333 

0.3333333333 

0 

trt 3 

y 

0 

19,5 

- 12 ° 

0 

-6 

0 

0 


X Generalized Inverse 

(g2) 



trt 1 

trt 2 

trt 3 

y 

Intercept 

-0.333333333 

-0.666666667 

0 

19.5 

block 1 

-0.666666667 

-0.333333333 

0 

-12 

block 2 

-0.333333333 

0.3333333333 

0 

-6 

block 3 

0 

0 

0 

0 

trt 1 

1.3333333333 

0.6666666667 

0 

16 

trt 2 

0•6666666667 

1.3333333333 

0 

3 

trt 3 

0 

0 

0 

0 

y 

16 

3 

0 

1.5 


Dependent Variable : y 





Sum of 





Source 


DF 

Squares 

Mean Square 

F 

Value 

Pr > F 

Model 


4 

241.3333333 

60.3333333 


40.22 

0.1176 

Error 


1 

1.5000000 

1.5000000 




Corrected 

Total 

5 

242.8333333 






R-Square 

Coeff Var Root 

MSE y 

Mean 



0.993823 

6. 

,175184 i .224745 19. 

83333 


Source 


DF 

Type I SS 

Mean Square 

F 

Value 

Pr > F 

block 


2 

24.3333333 

12.1666667 


8.11 

0.2410 

trt 


2 

217.0000000 

108.5000000 


72.33 

0.0829 

Source 


DF 

Type III SS 

Mean Square 

F 

Value 

Pr > F 

block 


2 

108.0000000 

54.0000000 


36.00 

0.1170 

trt 


2 

217.0000000 

108.5000000 


72.33 

0.0829 
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0■0690 
0.0561 
0.2804 


1 vs 2 13.00CO000 1.41421356 

1 vs 3 16.0C-OOCOO 1.41421356 

2 vs 3 3.0000000 1.41421356 


Table 9.18 (Continued) 




Standard 



Parameter 

Estimate 

Error 

t Value 

Pr > |t i 

Intercept 

19.500000C0 B 

1.11803399 

17.44 

0.C365 

block 1 

-12.00000000 B 

1.41421356 

-8.49 

0.0747 

block 2 

-6.00000000 B 

1.41421356 

-4.24 

0.1474 

block 3 

0.00000000 3 




trt 1 

16.00000000 B 

1.41421356 

11.31 

0.0561 

trt 2 

3.00000000 3 

1.41421356 

2.12 

0.2804 

trt 3 

O.OOOOOCOO B 





NOTE : The X r X matrix has been found to be singular, and a 
generalized inverse 

was used to solve the normal equations. Terms whose estimates are 
followed by the letter ' B y are not uniquely estimable. 

Least Squares Means 

Coefficients for trt Least Square Means 


Effect 


trt Level 

1 

2 

3 

Intercept 


1 

i 

1 

block 

1 

0.33333333 

C.33333333 

0.33333333 

block 

2 

0.33333333 

0.33333333 

0.33333333 

block 

3 

0.33333333 

0.33333333 

0.33333333 

trt 

1 

i 

0 

0 

trt 

2 

C 

丄 

0 

trt 

3 

0 

C 

1 


Standard 

trt y LSMEAN Error Pr > It I 

1 29.5000000 0.9574271 0.0207 

2 16.5000000 0.9574271 0.C369 

3 13.5000000 0.3574271 0.0451 


Dependent Variable : y 

Standard 

Parameter Estimate Error t Value Pr > ;ti 


9 12 
13 1 





364 


CHAPTER 9. RANDOMIZED BLOCK DESIGNS 


Table 9.19 Balanced Incomplete Block Design (RNE) 


a) Input statements: 


data BIBD; 
input block trt y; 
datalines; 

1 1 24 
1 2 10 

2 1 29 

2 3 14 

3 2 23 
33 19 

run; 

proc glm data=BIBD; 
absorb block; 

model y=trt/xpx inverse solution: 

title 1 'BALANCED INCOMPLETE BLOCK DESIGN’; 

title2 ’ (t-3, b=3, k-2, r=2, lambda-1 )’； 

title3 ’USING REDUCED NORMAL EQUATIONS'; 

run; 

b) Output: 


BALANCED INCOMPLETE BLOCK DESIGN 
(t=3, b=3, k=2, r=2, larr.bda=I) 
USING REDUCED NORMAL EQUATIONS 

The GLM Procedure 
Class Level Information 
Class Levels Values 

trt 3 123 


trt 1 
irt 2 
trt 3 
y 


Number of Observations Read 6 

Number of Observations Used 6 


The X'X Matrix 

zrt 1 trt 2 trt 3 

1 -0.5 -0.5 

-0.5 1 -0.5 

-0.5 -0.5 1 

14.5 -5 -9.5 


y 



X’X Generalized Inverse (g2) 


trt 1 trt 2 trt 3 


y 


trt 1 
trt 2 
trt 3 


1.3333333333 

0.6666666667 

0 


0.6666666667 

1.3333333333 

0 


16 
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Table 9.19 (Continued) 


Dependent 

Variable : y 


Sum cf 





Source 


DF 

Squares 

Mean Square 

F 

Value 

Pr > F 

Xcdel 


4 

24:.3333333 

60.3333333 


40.22 

0.1176 

Error 


1 

1.5C000C0 

1.500000C 




Corrected 

Total 

5 

242.8333333 






R-Square 

Coeff Var Root 

MSE y 

Mean 



0.993823 

6. 

,175184 1.224745 19.83333 



Source 


DF 

Type I SS 

Mean Square 

F 

Value 

Pr > F 

block 


2 

24.3333333 

12.1666667 


8.11 

C .2410 

trt 


2 

217.0000000 

108.5000000 


72.33 

0.0829 

Source 


DF 

Type III SS 

Mean Square 

F 

Value 

Pr > F 

trt 


2 

217.QCOOOOO 

108.5000000 


72.33 

0.0829 





Standard 



Parameter 


Estimate 

Error 

z Value 

?r > |t! 

trt 

1 

16.COOCOOOO B 

1.41421356 

11.3 ： 

0.0561 

trt 

2 

3.00000COO E 

1.41421356 

2.12 

0.2804 

trt 

3 

0.00000000 B 





NOTE : The X matrix has been found to be singular, and a generalized inverse 
was used co solve the normal equations. Terms whose estimates are 
followed by the letter ' are not uniquely estimable. 
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9.11 EXERCISES 

9.1 Consider the following data from an experiment testing the effects of 5 levels 
of application of potash on the Pressley strength index of cotton (John and Que- 
nouille, 1977) 一 



Treatments* 

1 

2 

3 

4 

5 

Block 1 

7.62 

8.14 

7.76 

7.17 

7.46 

Block 2 

8.00 

8.15 

7.73 

7.57 

7.68 

Block 3 

7.93 

7.87 

7,74 

7.80 

7.21 


* Pounds KqO per acre, expressed as units. 


(i) Obtain the ANOVA table and test the hypothesis that there are no differ¬ 
ences among the treatments. 

(ii) Since the treatments are quantitative, rather than making comparisons be¬ 
tween individual treatments, it is preferable to explore the response curve. 
Partition the treatment sum of squares into three components due to lin¬ 
ear effects, quadratic effects, and remainder. Test for linear and quadratic 
effects. 

(iii) Suppose the actual levels of the treatments are 36, 54, 72 ， 108, and 144 
pounds K 2 0 per acre, respectively. Partition the treatment sum of squares 
into a component due to linear response and a component due to deviation 
from linear response. Test for linearity. 

(iv) Find the relative efficiency of this design and interpret the result. 

9.2 Consider an experiment with 5 treatments in a RCB design with 10 blocks. The 
partial ANOVA table is as follows: 


Source d.f. SS MS 


Blocks 135 

Treatments 100 

Residual 


Total 


307 


(i) Complete the ANOVA table above. 

(ii) Give the test statistic for testing Hq : t\ ~ T 2 = tz = — 0. 

(iii) Suppose a preplanned comparison of treatments is that of comparing treat¬ 
ment 1 against the average of treatments 2 and 3. Give the estimated vari¬ 
ance for the estimate of this treatment comparison. 
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(iv) Suppose you have supplementary information in the form of a variate x. 
Performing an analysis of covariance you obtain (among other quantities) 

^2(yik - Vi. - y.k+y..)( x ik - s：i. -x, k + x..) = —20 

i，k 

— — T.fc + 尤 ..)2 = 10 

i y k 

Obtain the estimate of 

(v) Under the analysis of covariance model what is the estimator for the con¬ 
trast in (iii) and its estimated variance (use numerical values where possible 
based on the information provided in this problem). 

9.3 A chemist wants to compare three treatments. The experimental material he 
plans to use comes from four different manufacturers. He expects systematic 
differences among the material from the different manufacturers. Moreover, he 
is interested in finding out whether differences between treatment effects depend 
on the manufacturer. There is sufficient experimental material from each manu¬ 
facturers for 12 experimental units. 

(i) What experimental design should he use for his experiment? 

(ii) Suppose he comes to you with the following partial ANOVA table: 

Source d.f. SS MS 


Manufacturers (M) 20 

Treatments(T) 25 

M xT 5 

Error 2,3 


Total 


Fill in the ANOVA table. 

(iii) Give the SAS statements (classes, model) for the ANOVA in (ii). 

(iv) The chemist claims that he has used four replications per treatment, per 
manufacturer. After some questioning you find out that the four “repli- 
cations” are actually two replications and two duplicate measurements for 
each experimental unit. In this case what should the ANOVA really look 
like? Give sources of variation and d.f. 

(v) Give the SAS statements (classes, model) for the ANOVA in (iv). 

(vi) Unfortunately, the computer file with his original data was destroyed. In 
order to correct the situation and come up with a reasonable ANOVA table, 
you devise a small experiment which will allow you to obtain an estimate 
of the sampling (observational) error variance component. Suppose this 
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estimate equals 1. Assuming that this is a reasonable estimate for the actual 
experiment, that is, assuming that this value was obtained from the actual 
experiment, complete now the ANOVA table given in (iv) by filling in d.f” 
SSs and MSs. 

(vii) Based on the ANOVA in (vi), give the test statistic for testing the null hy¬ 
pothesis that there is no M x T interaction. 

(viii) Based on the ANOVA in (vi), give the standard error for a simple treatment 
comparison (that is, the differences between the estimates of two treatment 
effects). 

9.4 In a study of reaction time under the influence of alcohol, age is thought to 
be another factor that could affect the time. Test subjects (individuals) were 
classified into three age groups: 20-39, 40-59, 60 and over. In each age group 
each treatment (0 oz., 1 oz” 2 oz.) was randomly assigned to 4 individuals. The 
following results were obtained (the reaction time is measured in seconds): 



Amount of Alcohol 

Age 

0 oz. 

1 oz. 

2 oz. 

20-39 

.42 

•49 

.65 


.45 

.47 

.60 


.39 

.46 

.70 


.40 

.51 

.66 

40—59 

.51 

•70 

1.05 


.55 

.69 

1.10 


.53 

.73 

.98 


.50 

.75 

1.12 

60 and Over 

•60 

.85 

1.25 


•59 

.79 

1.20 


.58 

.88 

1.30 


.61 

,90 

1.27 


(i) What kind of experimental design has been used? 

(ii) Give the appropriate ANOVA table. 

(iii) Investigate whether the differences in reaction time for the different amounts 
of alcohol depend on the age group. 

(iv) Is there a difference in reaction time for 1 oz. and 2 oz.? Does it make 
sense to consider this question? 

9.5 Suppose a researcher comes to you with a table of data, obtained from a block 
design, that looks as follows: 
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Treatment 

1 

2 

3 

Block 1 

XXX 

XXX 

XXX 

Block 2 

XXX 

XXX 

XXX 

Block 3 

XXX 

XXX 

XXX 

Block 4 

XXX 

XXX 

XXX 


where each x represents one observation. She asks you to analyze the data. 

(i) How would you determine how to analyze the data? 

(ii) Describe two experimental plans that could have given rise to this data set. 

(iii) For each of the plans (designs) you have identified in (ii) give 

(a) an associated linear model, 

(b) the ANOVA table based on this model, 

(c) the variance of a simple treatment comparison, 

(d) the estimator for the variance in (c). 

(iv) Describe how you would do iii(b) in SAS, including the test of no treatment 
differences. 

9.6 In studying the effects of pollution, seedlings are usually exposed to specified 
pollutants for a certain period of time (say 6 hours) during the day for several 
weeks after which an evaluation is made. The seedlings are put in a pollution 
chamber which then receives the pollutant. Consider a specific case in which an 
investigator wants to compare 4 pollutants: Po = filtered air, Pi = O3, P2 = 
NO2, and P3 = O3 H- NO2 at specified levels of concentration. He has only 
limited resources. 

In particular, he has only 8 pollution chambers in each of which he can put 3 
seedlings. He feels that this is not adequate. So he decides to use the same 
chambers for 6 hours during the day and for 6 hours during the night, that is, 48 
seedlings are used for the experiment. He is sure that there will be systematic 
differences between the day and night results. 

(i) Describe an experimental plan for this experiment. What is the name for 
the experimental design that he is using? 

(ii) The investigator wonders whether certain comparisons among the pollu¬ 
tants will depend on whether one uses the results from day or night expo¬ 
sure. What kind of “effect” is he talking about and how can he investigate 
that? 

(iii) Give an appropriate linear model for analyzing data from this experiment. 

(iv) Outline the ANOVA table (source of variation, d.f., E(MS)) and indicate 
what hypotheses can be tested and how. In particular, what useful hypothe¬ 
ses about the treatment effects can be tested given the particular pollutants 
used in this experiment? 
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9.7 A researcher comes to you with data from a block design. For comparing 4 treat¬ 
ments he has used 5 blocks each with 4 experimental units. He took 2 measure¬ 
ments on each experimental unit. He comes to you with the following ANOVA 
table: 


Source 

d.f. 

SS 

Blocks 

4 

SS(B) 

Treatments 

3 

SS(T) 

Error 

32 

SS(E) 

Total 

39 



(i) What name would you give to the experimental design that he used. 

(ii) Give a linear model for the data from this design and outline the ANOVA 
(source of variation, d.f.). 

(iii) Explain why or why not your ANOVA table in (ii) agrees with the re¬ 
searcher^ ANOVA table given above. 

(iv) Give the formula for the theoretical variance for the comparison between 
two treatments using this design. 

9.8 Suppose a researcher comes to you for advice about the analysis of an experiment 
she has conducted. She has used 5 treatments and she has 9 observations for each 
treatment. She shows you the following ANOVA from a computer printout: 

Source d.f. SS F-value Pr > F 


Treatments 4 100 8.33 .0001 

Error 40 120 


The researcher wants to know from you whether the analysis is correct, and if 
so what it means. For each of the following scenarios answer the following 
questions: 

(i) What is the name of the design? 

(ii) What is the appropriate model? 

(iii) Based on the model in (ii) what is the corresponding ANOVA (include 
sources of variation, d.f” form of test for testing Hq . •丁 i 二丁 2 = … = 了 5 ). 

(iv) Is the ANOVA from the print-out above appropriate for testing Hq : ri = 

= T5? 

(v) What are the SAS statements for doing (iii) and (iv)? 
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Scenario I: 45 animals were used for the study and each treatment was 
assigned to 9 animals at random; 


Scenario II: 45 animals were used but they came from 9 different litters of 
size 5 each and each treatment was assigned to one animal in each litter; 


Scenario III: 15 animals were used, each treatment was assigned to 3 ani¬ 
mals at random, and 3 observations were obtained from each animal. 

9.9 Consider the following experimental data: 



Treatment 

1 

2 

3 

4 

5 

Block 1 

5.2 

7.3 


6.8 

10.0 

Block 2 

8.5 

9.2 

12.7 

10.6 

11,6 

Block 3 

4.1 

6.3 

9.3 

7.1 

9.8 

Block 4 

6.4 

8.9 

11,2 

5.7 

11.3 


(i) Do the exact analysis, obtaining the ANOVA table, LS means for treat¬ 
ments, and the estimated variance for simple treatment comparisons (use 
SAS PROC GLM). 

(ii) Estimate the missing value and obtain the approximate ANOVA. 

(iii) Using the analysis of covariance technique, obtain a general expression for 
the treatment LS means and the variance of differences of LS means. 

(iv) Obtain numerical values for the expressions in (iii) using the data above. 

(v) Compare the results for (i) and (iv). 

9.10 Obtain an expression for the bias of SS(T) when the estimate of the missing 
value is used as if it were the real observation. [Hint: Compare SS(T) with the 
SS obtained from the analysis of covariance.] 

9.11 Consider a RCBD with subsampling. Specifically, suppose t = 3 treatments, 
6 = 4 blocks, and n = 2 observations per experimental unit. 

(i) Give a linear model for data from such an experiment. 

(ii) Outline the ANOVA table, giving sources of variation, d.f., 五 (MS). 

(iii) Write SAS statements to carry out the ANOVA in (ii). 

Now suppose we have supplementary information in the form of a covariate 
x for each experimental unit. 

(iv) Give a linear model for analyzing data from such an experiment. 
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(v) Outline the corresponding AN OVA table, giving sources of variation and 
d.f. 

(vi) Give a SAS statement for carrying out the ANOVA in (v). 

Using the following data set 



Treatment 


1 

2 

3 

Block 1 

(5,7) 

(10, 9) 

(15, 18) 


2 

4 

5 

Block 2 j 

(10,11) 

(14, 13) 

(20, 22) 


4 

5 

3 

Block 3 

(12, 15) 

7 

(18, 20) 

3 

(24, 27) 

5 

Block 4 

(20, 18) 

(21,24) 

(35, 33) 


6 

2 

8 


(where the two numbers in parentheses represent the duplicate observations 
and the number underneath represents the covariate x) perform the two 
ANOVAs as explained in (iii) and (vi). 

9.12 A researcher wants to do an experiment using mice as the experimental units. She 
comes to you for advice on how to set up the experiment. Here is the situation: 
She wants to compare the effects of feeding three levels of calcium, say 0, 1, 
2 units, on certain bone measurements (which can be observed only after the 
mice have been sacrificed). She has available 4 litters of 6 mice each, each litter 
containing 3 females and 3 males. Each litter comes from a different breed of 
mice. 


(i) Give two experimental designs that might be suitable for this experiment. 
For each design 

(a) state any assumptions that would have to be made. 

(b) Give the associated linear model. 

(c) Sketch the ANOVA (source of variation, d.f., E(MS)). 

(d) State what inferences can be made. 

(ii) Explain how you would investigate the question that the two sexes react 
differently to the calcium treatments. 

(iii) The experimenter plans to make duplicate measurements on each animal. 
For both designs [given in (i)] give the estimators for the linear and quadratic 
effect of calcium and their standard error. 
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Latin Square Type Designs 

10.1 INTRODUCTION AND MOTIVATION 

To introduce a new and important type of blocking, using two or more blocking factors 
(intrinsic and/or nonspecific), let us consider the following example. 

Example 10.1: Suppose a manufacturer wants to investigate and compare different 
production processes for ceramic cookware. Extraneous sources of systematic variabil¬ 
ity are identified as (i) different batches of raw material and (ii) different ovens used 
for baking the product. The batches of material are obtained from different sources 
and possibly at different times. The ovens available in the manufacturing plant are of 
different makes and different ages. If the raw material were indeed uniform, we would 
us a RCBD with the ovens as blocks. And if, on the other hand, the ovens were all 
of the same type, we would use a RCBD with the batches as blocks. In the present 
situation, however, we would like to use both batches and ovens as blocks in order to 
eliminate systematic variability and hence reduce the experimental error. 

Suppose we have r batches of experimental material, Bi ， B 2 , … ， B r say，and c 
ovens Oi, O 2 ,... ， O c say. One way to proceed then perhaps might be to divide each 
batch into c equal parts and “form” blocks of the type (BiOj), that is, combining each 
batch with each oven. If we have t treatments (processes) we would then mold t pieces 
of cookware, that is, t EUs, for each block (BiOj) and then bake each piece in the 
assigned oven at the assigned process (treatment). In this whole experiment we would 
then have ret EUs and the observations would be analyzed according to the RCBD 
analysis with rc blocks and t treatments (see Section 9.6.7 for crossed blocking factors). 
□ 


The above procedure may be feasible for some experiments, but impossible for 
others since it may require too many EUs. In other cases such an arrangement may be 
physically impossible. An example of this is a field experiment where the experimental 
field exhibits a gradient in two (orthogonal) directions. This too is a situation where one 
may want to utilize simultaneously two blocking factors or, to use a common phrase, 
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block in two directions. Error-control designs other than the one described above have 
been developed for that purpose. Such designs are referred to as designs for two-way 
elimination of heterogeneity, or row-column designs. We shall refer to them as Latin 
square type designs because in many cases the Latin square design, a special form of 
row-column design developed by R. A. Fisher (1925, 1926), is the basic building block 
of such designs. 

Example 10.1 (continued): To complete the discussion of our example in the context 
of the preceding remarks we shall give appropriate error-control designs for three situ¬ 
ations: (i) t = 4, r = 4, c = 4; (ii) t = 4, r = 8, c = 4; and (iii) i = 4, r = 8, c = 5. 
These designs using rc EUs are given in Table 10.1 where the processes (treatments) 
are designated by A, B, C, D. To understand the schematic representation of these 
designs, let us look at design (i): A piece of cookware from batch 1 is produced ac¬ 
cording to process D and then baked in oven 1, a piece of cookware from batch 2 is 
produced according to process C and also baked in oven 1, and so on. □ 

How these designs were obtained will be discussed in the following sections, but the 
reader should have no difficulty recognizing that these designs have certain structures 
and combinatorial properties: In design (i) each treatment (Latin letters) occurs once 
in each row and once in each column; in design (ii) each treatment occurs once in each 
row and twice in each column (rows 1, 2,4, 7 constitute in fact design (i) and rows 3, 5, 
6 , 8 constitute design (i) with permuted columns); and design (iii) is an augmentation 
of design (ii) with column 5 being formed by columns 1 and 4 of design (i). We shall 
show in the following sections that these combinatorial structures make it possible to 
obtain estimates of treatment effect contrasts, which after all is the objective of the 
experiment. 

As in these introductory remarks we shall begin with the simplest design for two- 
way elimination of heterogeneity ， the Latin square design, which is then used as the 
building block for more complex designs involving two or more blocking factors. 


10.2 LATIN SQUARE DESIGN 

10.2.1 Definition 

The Latin square design represents, in some sense, the simplest form of a row-column 
design. It is used for comparing t treatments in t rows and t columns, where rows 
and columns represent the two blocking factors. Latin squares and their combinatorial 
properties have been attributed to Euler (1782). They were proposed as experimental 
designs by Fisher (1925, 1926), although De Palluel (1788) already utilized the idea of 
a 4 x 4 Latin square design for an agricultural experiment (see Street and Street, 1987, 
1988). 

Mathematically speaking, the Latin square of order t is an arrangement of t Latin 
letters in a square of t rows and t columns such that every Latin letter occurs once in 
each row and once in each column (see design (i) in Table 10.1)，or more generally, 
the arrangement of t symbols in at x t array such that each symbol occurs exactly 
once in each row and column. In the context of experimental design, the Latin letters 
are the treatments. Latin squares exist for every t. A reduced Latin square (or Latin 
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square in standard form) is one in which the first row and the first column are arranged 
in alphabetical order, for example, for t = 3, 


this is the only reduced Latin square. The number of squares that can be generated from 
a reduced Latin square by permutation of the rows, columns, and letters is (^!) 3 . These 
are not necessarily all different. If all rows but the first and all columns are permuted, 
we generate t\(t - 1)! different squares. From the reduced Latin square of order 3 we 
can thus generate 12 squares. 

In general, if the number of reduced squares of order t is denoted by T t and the 
total number of Latin squares of order t by U t , then U t = t\{t — l)\T t (see Denes and 
Keedwell, 1974). Below we give a list of T t for i = 2.3,.... 8 (see e.g” Denes and 
Keedwell, 1974): 


t 

T t 

2 

1 

3 

1 

4 

4 

5 

56 

6 

9,408 

7 

16,942,080 

8 

535,281,401,856 


10.2.2 Transformation Sets and Randomization 

An enumeration of all possible reduced Latin squares is facilitated through the notion 
of transformation sets which is defined as follows: One square of the transformation 
set may be obtained from the others by permutation of letters and subsequent rear¬ 
rangement into reduced or standard form. For t = 4 there exist two transformation sets 
as given below. 


c A 5 
cqcyl 
Acqc 


⑶ 


DccqA 
cADcq 
E D ^ c 
A B c Q 


⑵ 


Z) 4 RC 
CDAcq 
CQCD 4 
A cqcD 


ncBA 
CDAR 
H 4 DC 
ASCD 


1 i 


DC 4 R 
CDR .4 
5 ADC 
4 SCD 


11 

et 

s 


2 : 

et 
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In the first transformation set, square (2) can be obtained from square (1) by inter¬ 
changing A and D in (1) and rearranging, thus: 


On the other hand, m set 2 any interchange of letters and rearrangement into standard 
form leads to its reproduction. Thus the two transformation sets above account for all 
T 4 = 4 reduced squares. 

For ^ = 5 there exist two transformation sets, one containing 50 reduced squares 
and the other containing 6 reduced squares, thus accounting for all T 5 = 56 reduced 
squares. 

For t = 6 there exist 22 transformation sets which contain a total of Tq = 9,408 
reduced Latin squares. 

Fisher and Yates (1957) list examples of Latin squares for t = 4, 5,..., 12. 

The actual randomization procedure for Latin square designs was given by Yates 
(1933) and is described by Fisher and Yates (1957) as follows. The first step is to 
select a reduced square at random. For squares of order 3, 4, or 5, the second step is to 
permute all rows except the first and all columns, or all rows and all columns except the 
first, and assign treatments at random to the letters A, B, C ,.... For squares of order 
6 permute all rows and columns, and then assign the letters to treatments at random. 
For larger squares, it is satisfactory to take any square and permute rows, columns, and 
treatments. 

The randomization procedure for a 4 x 4 Latin square using SAS PROC PLAN is 
illustrated in Table 10.2, where the treatments are represented by the numbers 1 ， 2, 3, 4. 

10.2.3 Derived Linear Model 

We shall now examine the Latin square design (LSD) as an error-control design from 
the same point of view as that which we used for the RCBD. We suppose then that 
the subscripts (i, j, k) denote the row, column, and treatment of a particular EU. In 
all there are t 3 possible responses, for each treatment can conceptually be applied to 
each EU, and from this population of true yields we draw a sample which is based on 
a random t x t Latin square as described above. Such a sample has obvious properties 
of balance, particularly when we are concerned with the comparison of treatments. 

Assuming unit-treatment additivity in the strict sense and following Kempthorne 
(1952) we write the conceptual response of treatment k in row i and column j as 

Tijk = Uij + Tfc. (10.1) 

where Uij is the contribution from the EU in the zth row and jth column and Tk is the 
contribution from treatment k. We rewrite (10.1) as 

T ijk = U.. + (Ui. - U..) + {U . 3 - U.) 

+ (Uij - Ui. - tJ.j + U.) + T + (T k -f) 

=n + Pi + Jj +T k + Uij , (10.2) 


QA cqc 
CQ4cq 

R- c p A 

乂召 CD 

T 

QcqcA 
c ARQ 

ACDCQ 

T 

ACDS 

c AcqQ 

cqD^c 
DH c yl 
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Table 10.2 Randomization for 4x4 Latin Square Design 


a) Input statements: 


factors rows=4 ordered cols=4 ordered/noprint; 
treatments tmts=4 cyclic; 
output out=LS 

rowscvals=(’Bl’ ’B2’ ， B3’ ’B4’）random 
cols cvals= COr ， 02, ， 03, ’04’）random 
tmts nvals= (1 2 3 4) random; 

quit; 

proc tabulate data=LS; 
class rows cols; 
var tmts; 

table rows, cols*(tmts*f=6.) / rts=8; 

title ’RANDOMIZATION FOR 4X4 LATIN SQUARE DESIGN’ ； run; 

run; 

(b) Output: 


RANDOMIZATION FOR 4X4 LATIN SQUARE DESIGN 




cols 

_ . 



01 

02 

03 

04 


tmts 

tmts 

tmts 

… 

tmts 


Sum 

Sum 

Sum 

Sum 

rows 





B1 

1 

2 

4 

3 

B2 

3 

1 

2 

4 

B3 

2 

4 

3 

1 

B4 

4 

3 

1 

2 
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where 

is the average conceptual response, 

Pi = Ui. - u.. 

is defined as the 2 th row effect, with Epi = 0 , 

= U, - U. 

is defined as the jth column effect, with E 7 j = 0 , 

Tk~T k - T. 

is defined as the fcth treatment effect, with = 0 , and 

^ij = 一 U"i. - U.j U., 

=(Uij - U.) - (Ui. - U.) - (Uj - U.) (10.3) 

expresses the heterogeneity of the EUs in the sense that the contribution of EU (ij) is 
not made up additively of a row effect (pi) and a column effect ( 7 ^). In fact, technically 
we may refer to as a row-column interaction effect. 

We denote an actually observed response in the ith row and jth column by Zij. Let 
be the design random variable which takes the value unity if treatment k occurs on 
EU (i, j) and is zero otherwise. We can then write 

z a = Y. 5 ^ k - (io.4) 

k 

Substituting from (10.2) we obtain 

Zij = /i + Pi + 7j + S^Tk + Uij. (10.5) 

k 

Alternatively, we can express an observation on a given EU in terms of the treatment it 
received. Let Xki denote the observation from the lih application of treatment k and let 
(容 denote a random variable with = 1 if the Ith application of treatment k falls on 
EU (z, j) and = 0, otherwise. Then 

叫 1 = (10.6) 

u' 


We are mainly interested in treatment means and contrasts among them. From (10.6) 
we then have 


知 . = 

l ij 

ij 

=M + T/c + j 的 jUij 


(10.7) 
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since E / 造 = 吃 and = t. 

The joint distribution of the Sfj's is determined by the particular family of Latin 
squares from which the one actually used is chosen at random. If we use the random¬ 
ization procedure described earlier we have 

p (. s ij = !) = y 

and 

= o) = i - y. 

The Latin square structure implies that if 5^ = 1 then 5^ = 0 and 6^, = 0. This 
implies 

p (碎二 M~ = i) = o (WO 

p ( s ij = =1) = 0 (j ^ f). 

We also have 

P{Sij = l.Si， r = 1 ) 二 t ， t ]_ r ， {i + i',j ^ j'). 

Having established some properties of the we can now investigate the distri¬ 
butional properties of the models (10.5) and (10.6) or, more importantly, (10.7). Since 

Er{5%) = \ 

it follows from (10.7) immediately that 

ER{x k ) = fl + Tk + 歹 

ij 

or, using (10.3) and the fact that T，iUij = = 0 ， 

E R {x k ) = " 十 7-fc. (10.8) 

10.2.4 Estimation of Treatment Contrasts 

It follows from (10.8) that a contrast among the treatment effects, with E^c/c = 

0 , is estimated unbiasedly by the corresponding contrast among the treatment means, 
that is, 


五 i ? ( 〉 ： j — 〉二 CfcTfc . 

V fc / k 


(10.9) 
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In order to obtain vdr^TikCkXk.) we consider first 

var fl (5 fc .) = E R [x k . - E R (x k .)} 2 


Er 




Er 


L 

E ( 吃 ) H + EE 唣碎户 




善 E E 棘 w + EE 4 咯 ' 


从 -ii 飞 


i 


的 ' j¥=f 


t 2 


J2 u l + o + o + j^r[)J2Yl 叫叫 




i 科 3^3' 


using the properties of the 嗦 ’s. Now 


^i3 u vy ~ ( 二 1 

3^r 




YK ~J2Y1 u ^'j 

ij j 


= 0 ^^2 u ij + Yl u ?i + Y1 u l 

ij ij ij 

using again (10.3). We then have, substituting into (10.10), 

varfife )= ^ G + ^ i ))^^ 


Similarly，we can obtain for k’ 妾 k 


cov R (x k ,.x k '.) = ^ 


即 — 1)2 兮 4. 


Using (10.11) and (10.12) it then follows that 


var；? ^y^CfcXfc.j 二 * 


CkCk ， 


YA 


ij 

E' 


k^k' 


(t 


k %_D 2 ir ij 


( 10 . 10 ) 


u ij u ij 

i 


( 10 . 11 ) 


( 10 . 12 ) 


Y)2 S U \ 

^ _ 

(10.13) 
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If we define 

Y, u l (10 . i4) 

we can write (10.13) as 

var/? j = (10.15) 

The problem then remains to estimate g\. This is achieved again by means of the 
analysis of variance. 

10.2.5 Analysis of Variance 

To write out the ANOVA table it is convenient to write the observation on EU (i ， j) 
which has received treatment k as ytj(k) where () indicates that not all possible triplets 
(i, k) occur in the array, only t 2 out of t 3 . With this notation we then have 

Vij(-) — z ij 

Vi.(-) = 為 . 曲 ㈠ =〜 
y..(k) = Xk. 

= 足. = 无 ... 

The ANOVA as given in Table 10.3 is based on the identity for y i ： j ⑻: 

Vij(k) = y..(.) + ( 仄 .(.) 一没 ••(.)）+ ~ 安 ..(.)) 

+ (y..(k) - 没 ..(.)） + (vij(k) - Vi.(-) - y.j(-) ~ y..(k) + 2 歹 ..(.)) 

and is obtained by squaring both sides and summing over all occurring combinations 

ij(k). 

The 五 (MS) given in the left-hand column are based on the models (10.5) and (10.6) 
as established earlier and follow from randomization theory. The results show that the 
LSD is an unbiased design in Yates’ sense and that MS(_E) is an estimate of 

To test the hypothesis i^o ： r i = r 2 = • ■ • = ^ = 0 we are led by the 丑 (MS) in 
Table 10.3 to consider the ratio 

尸二 MS(T) 

_ MS{E) 

This ratio F will be evaluated for the square actually used and for all other squares 
that we could have obtained by the randomization procedure. If the value for the square 
actually used is equaled or exceeded by that of P percent of the possible arrangements 
(including the one used), we shall say that we have significance at the P percent level. 
The evaluation of the significance in a particular experiment could be somewhat labori¬ 
ous, and we rely on the fact that, similar to the result for the RCBD (see Section 9.2.5 )， 
the distribution of the criterion F will be closely approximated by the F-distribution 
with t — 1 and (t — l)(t — 2) d.f. 



10.2. LATIN SQUARE DESIGN 


383 


s 

Add. in broad sense 

r— ! 

1 

■1^ 

w- 

+ 

« a 
fc 

+ 

M ? 

b 

1 

w- 

+ 

b 

+ 

b 

1 

w- 

+ 

M S 

b 

+ 

b 

+ 

!N p- 

b 

CN S 

b 
+ 
n a 
b 

十 

t 



u 

1 

ts 

1 

.£ 

< 

1 

w- 

rH 

1 

w.~ 

1 

w- 

+ 

!N^» 

cy ： 



S 

1 

VO 

C/2 

II 

II 

1 

•W 

\ 

vT 

C/5 

II 

cn 1 

1 

1 

•+ *i 

C/5 

II 






^Ti 

s 



00 

xn 

S' 

It 

'?> 

1 

w- 

■w 

t 

II 

155 

1 

:？ 5 

w- 

00 

oo 

II 

CN_^ 

1 

w- 

•»c 

By subtraction = SS(E) 

CN 

1 

.，> 

wf 



— 

1 

i 

1 

7 

1 

T—( 

1 


V 

3 

1 

CC 

i 

_D 

u 

1 

1 

n 


asi JC2 VAOZV rolusex 



384 


CHAPTER 10. LATIN SQUARE TYPE DESIGNS 


The extent to which the distribution of the criterion F over the possible randomiza¬ 
tions may be represented by the F-distribution has been examined by Welch (1937). 
He showed that the quantity 


SS(T) 

ss(r) + ss ㈤ 


has a mean value of l/(t - 1), and this is the mean value of the beta distribution which 
is a transform of the F-distribution. This was obtained with only the assumption that 

for i ^ i\j j f (see Section 10.2.3) and therefore holds for any transformation set. 
Welch (1937) also found that var(t/) does depend on the transformation set, and also 
on the quantities 


G = 难 

h = ^ 


D = \ Y, u l 

\ 

Efe 


3 \ i 


where u-ij is as defined in (10.3). He examined var(J7) for some constructed data 
and for some sets of uniformity data and found the proportion of times the 5 percent 
value of U from the beta distribution was exceeded ranged from 2.7 to 6.2 percent. 
The approximation by the F-distribution is therefore not entirely satisfactory, but the 
evidence is not conclusive, in that the approximation depends on the quantities C, D, 
G, H above, and, in a particular case, we do not know these values, nor do we know 
the values we shall meet in practice. The rules given above for the choice of a random 
Latin square are designed to give equal probability to all possible Latin squares of size 
less than 7x7, and this appears to be a desirable procedure. To conclude this aspect of 
the Latin square, we shall assume that normal theory gives satisfactory approximations 
to corresponding randomization tests, but some care needs to be taken with small Latin 
squares as noted below. 

We consider first the 2 x 2 Latin square, for which there are only 2 different ones: 
namely, 

A B J B A 

BA and A B 

This square has no degrees of freedom for error, as is obvious from the fact that, if we 
use one square and obtain the treatment difference, then the treatment difference given 
by the other square is the negative of the difference with the former square. If then we 
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wish to compare 2 treatments with 2x2 Latin squares, we must use many squares，and 
to make any test we must assume that the difference is constant from square to square 
(see Section 10.3). The randomization test is so simply performed in this case, that 
with a small number of squares, say 6 or less，it would probably be advisable to rely on 
the randomization test procedures rather than the usual F-distribution approximation. 

In the case of the 3 x 3 Latin square, it is important to note that there are, in fact, 
only 2 different partitions of the 9 cells into 3 sets of 3, in such a way that each set is 
represented in each row and in each column. There are 12 different 3x3 Latin squares, 
but these give the same partitioning in sets of 3. If we wish to test the null hypothesis 
that there are no differences among the 3 treatments, we shall use the ratio of treatment 
mean square to error mean square as the test criterion. If this takes the value R with 
the randomization we in fact used, it will take the same value for 5 of the other 11 
randomizations, and the value l/R for the remaining 6. This happens because the sum 
of the treatment sum of squares with 2 degrees of freedom and the error sum of squares 
with 2 degrees of freedom is constant for all randomizations, and a randomization that 
gives a partitioning different from the one actually used will have the treatment and 
error sum of squares interchanged. We are, therefore, in the position of not being 
able to make a significance test, for which the chance of rejecting the hypothesis when 
true is less than 50 percent (or, in other words, of size less than .50). This fact is 
important because, if we use the normal theory model in which the errors in the model 
are assumed to be normally and independently distributed with mean zero and constant 
variance cr 2 , we shall use the F-test with 2 and 2 degrees of freedom and can make a 
test at any significance level we please. The distinction we make throughout this book 
between the derived and normal theory model is therefore extremely relevant. If we 
consider a particular treatment contrast, and evaluate it for the 12 possible 3x3 Latin 
squares，we shall find that there are 6 possible values which the criterion (mean square 
due to treatment comparison/error mean square) can take. We therefore only make a 
test of significance with level 1-in-6, of the hypothesis that the true comparison is zero, 
if we use a 2-tailed test. For these reasons, a single 3x3 Latin square experiment is 
virtually valueless, and, if we use a small number of replications (see Section 10.3 )， 
we should, as with the 2 x 2 square, probably use the randomization test procedures, 
although it is often found that the usual F-test gives a remarkably similar answer. 

There are in all 4(4!3!) or 576 different 4x4 Latin squares, but these lead to only 
24 different partitions of the 16 cells into 4 sets of 4, in such a way that each cell is 
represented in each row and in each column. It is therefore desirable to make the test 
strictly according to the randomization test procedure. 

Squares of side 5 and 6 were examined by Welch (1937). For squares of side 7 or 
more it seems reasonable to assume that the F-distribution is satisfactory. 

10.2.6 The Model under Additivity in the Broad Sense 

So far we have considered only the situation where unit-treatment additivity in the 
strict sense holds. To do so is useful to bring out some of the essential features of 
a LSD. From a practical point of view, however, the inclusion of technical errors is 
important, which leads us to the situation where unit-treatment additivity in the broad 
sense holds. We shall not go into all the details here but rather refer the reader to 
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our discussion of this topic in connection with the CRD (Chapter 6) and the RCBD 
(Chapter 9). The important point to remember here is that to the unit error, Uij, we 
now add another component of experimental error, 〜 ⑻， and the observational error, 
rjij(k)^ both with means zero and variances al and cr^, respectively. The E(MS) in 
terms of these variance components are given in the right-hand column of Table 10.3. 

As we have argued earlier (Chapter 6 and 9), for purposes of making inference 
about the treatments, it is convenient to define the experimental error as y ⑻ and the 
observational error as rj ^ ⑻， which can be considered as i.i.d. random variables with 
means zero and variances = cr^ + al and <7^, respectively. Going one step further 
we define the total error as which can be considered as i.i.d. random variables 
with mean zero and variance g\ = cr, + 0 ^. We then write (following (10.5)) the model 
for an observation on EU (ij) to which treatment k was applied as 

Vij(k) = " + Pi + + r fc + e ij(k)' (10.16) 

Its properties follow from our discussion above. In particular we obtain, as suggested 
by (10.15), that the treatment contrast Ec^r^ is estimated as Ecky.,(k) with 

var ^ c fc f fc j = var ^c fc y.. (fe) j = 宁 . 

As is obvious from Table 10.3, MS ( 五 ） is an estimate of and Ho •. tl = 丁 2 = … = 
r t can be tested as before by considering F - MS(T)/MS(E) as an F-statistic with 
t - 1 and (t — 1 )( 艺 _ 2) d.f. 

We point out that similar to our findings for the RCBD, the form of the E(MS) in 
Table 10.3 indicates that there do not exist legitimate tests for row and column effects. 
For an assessment of the effectiveness of blocking by rows or columns we refer to 
Section 10.2.9(iii). 

10.2.7 Consequences of Nonadditivity 

Just as in the case of the RCBD the assumption of unit-treatment additivity may not 
always hold. Wilk and Kempthome (1957) discussed the LSD in its most general form. 
They considered the situation where the t rows are sampled from a population of R 
rows, the t columns are sampled from a population of C columns, and the t treatments 
are sampled from a population of T treatments. They also amended model (10.16) to 
include row x treatment, column x treatment, and row x column x treatment interactions. 
If < 7 ^, G 2 ci , and a^ ct denote the variance components due to these interactions (for a 
precise definition we refer the reader to Wilk and Kempthorne, 1957), then the relevant 
•E(MS) from the ANOVA table can be written as follows: 

五 [MS(r)] =0^+(0+ Wet + R R t(J rt + ~^~ a ct J ^ t ^ 2 r k/( T - !) 

(10,17) 

•E[MS(E)] = CTg + (pCTrct + a rt + ^ct', (10.18) 
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where 

丄-丄」） 

R C T). 

For the special case R = C = T = t, which we have considered here, (10.17) and 
(10.18) reduce to 

E[MS(T)} =a 2 e +(l-^)a 2 rct + tJ2 4 / (卜 1) (10.19) 

and 

£ 畔 ) 卜心 ㈠ 仏 Wt . (10.20) 

The results (10.17) and (10.18) suggest that if /? 》 t and C 》 t (corresponding 
essentially to the situation of random row and column effects) then the usual F-test 
suggested above is still appropriate even in the presence of interactions. This is, how¬ 
ever, not true for the fixed effects situation as illustrated by comparing (10.19) and 
(10.20). In this case MS ( 五 ） is on the average larger than MS(T) under the hypothesis 
of no treatment effects and hence the usual F-test will lead to fewer significant results. 
In this case the LSD is not an unbiased design. For more details the reader is referred 
to Wilk and Kempthome (1957) and an interesting somewhat different discussion by 
Neyman et al. (1935). Another objection to the assumption of additivity is provided by 
Srivastava and Wang (1998). 

10.2.8 Investigating Nonadditivity 

The problem of unit-treatment interaction, mainly in the form of row-treatment and/or 
column-treatment interaction, is obviously an important one. There is, however, no 
easy method of detecting such interactions in a LSD. A partial solution has been sug¬ 
gested by Tukey (1955). His method was reformulated in terms of an analysis of co- 
variance by Rojas (1973) using an interaction model of the form 

Vij(k) = n + Pi + +Tk+ 0{piT k + ^fjTk + Pi7i) + e ij(k)' (10.21) 

Writing (10.21) alternatively as 

Vij(k) = /i + Pi + Jj + Tfc + + (10.22) 

and choosing 

x ij(k) = {pi "I" Tj + 子 k) 2 

= 2(piT k + ^/jfk + pnj) + {pi + A /j + Tfc) (10.23) 

Rojas showed that, as a generalization of the method described in Section 9.6, testing 
for Hq : 0 = 0 with model (10.22) is the same as Tukey’s one-degree-of-freedom test 
for nonadditivity in the LSD. 

Model (10.21) is obviously only one of several ways in which interactions in the 
LSD can be characterized. Rather than including all two-factor interaction terms as in 
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model (10.21) it may be more useful to (i) include only treatment-row and treatment- 
column interactions and (ii) include them as separate terms in an analysis of covariance 
model, thus simply extending Scheff6’s (1959) derivation of Tukey’s (1949) test (see 
Section 9.6). We propose to model the interactions (pr)ik and ( 7 r)jfe as (pr)ik = PiTk 
and = 7 jT^ and consider the model 


Vij{k) 

—M + ^ + 7j + r fe + ^l x ij(k) + + e ij(k) 

(10.24) 

with 

x ij(k) = Pi^k = 

~ (仄 .(.） 一 y..{-))[y..{k) - V••(.、) 


for all j, and 

z ij(k) = - 

= (h) — y..^)){y..{k) - 



for all i. Using the fact that = 元 ■•(&)= 0, 芝 .j(.) = — 0 and applying the 

method described in Section 8.7 (modified for the LSD), we obtain immediately 


A 


E'xy 

Exx 


ijk _ 


ik 


Etei.(-) - y..(-)) 2 E (仏 .⑻- y..(.)) 2 

i k 


and 


^ z ij{k)Vij{k) ~ y..(■))(y..(k) ~ y..{-))y.j{k) 

i^zy _ _ — fk _ 

E zz ~ E 4 (fc) — Sd(.) — y ..(.)) 2 E(y..(k) - y..c )) 2 

ijk 3 k 


(we have used the fact here that E xz = 0). We then obtain in the usual way 
SS(^) = and SS(0 2 ) = 

^XX L zz 

To test Hoi ： — 0 (i = 1,2) we use 

F = _ sm) _ 

1 [SS(£ , ) — SS(0i) — SS(02)]/[(f — l)(t — 2) — 2] 

as an F-statistic with 1 and (t - l)(t -2) — 2 d.f” where SS(E) is obtained from 
Table 10.3. 

As an alternative to model (10.24) we may, of course, only want to include one 
interaction term, depending on our knowledge of the experimental situation at hand. 
The modification to the F-test given above is obvious. Whichever model we consider, 
however, if interaction is indicated there does not seem to be an easy solution to a 
meaningful analysis of the data. Search for a suitable transformation to additivity may 
be an option, but it may not be easily achievable. 
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10.2.9 Miscellaneous Remarks 

We conclude this section with a few remarks, concerning topics that were discussed in 
great detail in earlier chapters: 


(i) Analysis of covariance. In addition to the blocking by rows and columns sup¬ 
plementary information may be available on the EUs and further reduction in 
experimental error variance may be achieved by using analysis of covariance. 
The formal procedure is similar to that discussed in Section 9.4 with the obvious 
and by now familiar modifications. 


(ii) Missing observations. The method proposed by Coons (1957) as discussed in 


Section 9.5 can be applied here also. The formula for a missing value in row i*, 
column j*, and treatment k\ corresponding to the development in Section 9.5.1 ， 
is now given by 


tR{* -|- + tTk* — 2G 


(10.25) 


using obvious notation. Formulae for several missing values can be obtained 
using several covariates and the methods of Section 8.7. Explicit expressions are 
given by Kramer and Glass (1960). 


(iii) Relative efficiency. It may be of interest to ask whether blocking in two directions 
has been useful compared to blocking in only one direction, either by rows or 
by columns, that is, using a RCBD with either rows as blocks or columns as 
blocks. Using the concept of a uniformity trial (see Section 9.3) and the resulting 
partition in the ANOVA，or randomization analysis, 

Source d.f. 


Rows 

Columns 

Residual 


t - 1 

(t-i ) 2 


we obtain the following EREs: 


(a) Rows used as blocks 

ERE (LSDtoRCBD Row J 

(b) Columns used as blocks 

ERE(LSDtoRCBD Columns 


MS(C) + {t- 1)MS ㈤ 
tMS{E) 


MS(i?)-f (f-l)MS(E) 
卜 tMS{E) 


(10.26) 


(10.27) 


where MS(i?), MS((7)，and MS(£-) are obtained from the ANOVA (see Ta¬ 
ble 10.3) of the completed experiment using an LSD. For another connection 
between the EREs and the ANOVA table see Lentner, Arnold, and Hinkelmann 
(1989). 
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It is sometimes argued that the LSD is not very useful from a practical point of view 
because of two limitations: (i) The numbers of rows，columns and treatments have to 
be the same, and (ii) for small values of t one has insufficient d.f. for error. We do 
not necessarily agree with such a strong viewpoint, but even critics agree that what we 
might call the Latin square principle, namely orthogonal blocking in two directions, 
is extremely important in the whole endeavor of experimental design. It gives rise 
to many more error-reduction designs as well as treatment designs which are of great 
practical value. Some such designs will be discussed in the next sections (see also 
Section II.6.6). 

10.3 REPLICATED LATIN SQUARES 

10.3.1 Different Scenarios for Replication 

One way to increase the error d.f. is obviously to replicate a LSD. How such replica¬ 
tions are carried out depends on the particular experimental situation. We shall use the 
example of Section 10.1 and illustrate, in accordance with our discussion in Chapter 2 
and Section 9.6.7, how different methods of replication lead to different linear models 
and hence to different analyses. 

Referring to Example 10.1, consider the following situations: 

Example 10.2: The basic experiment, using a LSD of order t, is replicated by the 
manufacturer r times as follows. Each of the t batches of experimental material is 
divided into r parts to be used in the r replications, respectively. The same t ovens are 
used in each replication. □ 


Example 10,3: Different batches of experimental material are obtained for each 
replication, t batches for each replication. The same t ovens are used in each repli¬ 
cation. □ 


Example 10.4: Rather than having the experiment replicated by one manufacturer, 
we may ask r different manufacturers to carry out the basic experiment. Each manu¬ 
facturer has his own suppliers of raw material and different ovens are available in the r 
different factories. □ 

In each of these experiments the randomization procedure, as described in Sec¬ 
tion 10.2, is carried out independently for each component Latin square. The major 
difference between the situations described above is whether, in classification termi¬ 
nology, the various blocking factors are crossed with each other or nested within each 
other (see Section 4.12). We shall discuss this now and show how that affects the 
analysis. 
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Table 10.4 ANOVA for Model (10.28) 


Source 


d.f. 

SS 


五 (MS) 

Replications 

r — 1 


i 

-S...0)) 2 


Batches 

t - 1 



- y...(-)) 2 


Ovens 

t - 1 


rtJ2(y..k(-) - 

k 

- y -c-)) 2 


Treatments 

t - 1 


- 

• y...(-)) 2 

+ - 1) 
l 

Error 

r(t - l)(t - 

2) + 3(r- l)(t- 1) 

Difference 



Total 

rt 2 - 1 


JZ ivijk(i) 

ijk(l) 

— y...(-)) 2 



10.3.2 Rows and Columns Crossed with 
Replications 

In Example 10.2 the factor “batches” and the factor “ovens” are crossed with the factor 
“replications” since the batches and ovens are the same in each replication. We assume 
that there are systematic differences between replications simply because of the time 
lag. As an extension of model (10.16) an appropriate model for observations from this 
design is then 

Vijk{i) = M + ft + + ^ + eijk(i )} (10.28) 

where pj，ui are as defined before and ai(i = l, 2 ， ... ， r) represents the effect of 
the 2 th replicate. This model leads to the ANOVA of Table 10.4. 

We should point out that (10.28) is only one possible model. If desired and war¬ 
ranted this experimental setup allows us to separate out from the SS(Error) a sum of 
squares due to replication x treatment interaction with (r — l)(t — 1) d.f. In fact, tech¬ 
nically, the d.f. for error as given in Table 10.4 are the sum of the error d.f. for the r 
individual Latin squares and the d.f. for the replication x row, replication x column, 
and replication x treatment interactions. 

10.3.3 Rows Nested in and Columns Crossed with 
Replications 

Since in Example 10.3 new batches of material are obtained for each replication, 
the factor “batches” is nested within the factor “replications.” As before, the factors 
“ovens” and “replications” are crossed. This leads to a model of the form 

Vijk{i) = ^ o>i pij -|- + eijk(i )， (10.29) 

where the pij (i = 1, 2 ,... ,r; j = 1 , 2 ,..., t) are now the row effects nested within 
replications. The ANOVA associated with model (10.29) is given in Table 10.5. Just as 
before, model (10.29) can be amended to include replication x treatment interaction. 
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Table 10.5 ANOVA for Model (10.29) 


Source 

d.f. 

SS £J(MS) 

Replications 

r — 1 


Batches/Reps 

r(t - 1) 

t S(yij.(-) - Vi ..(-)) 2 
i-.j 

Ovens 

t - 1 

k 

Treatments 

t - 1 

^E(y...(i) - y...( >) 2 4 - l) 

i i 

Error 

r(t-l)(t-2) + 2(r-l)(i- 1) 

Difference 

Total 

rt 2 — 1 

H (Vijkii) - y. ..(■)) 



ijk(l) 


10.3.4 Rows and Columns Nested in Replications 

Since in Example 10.4 different manufactures are involved，the batches of material 
and the ovens will be different from one manufacturer (that is, replication) to the next. 
Hence the factors “batches” and “ovens” are nested within the factor “replications,” An 
appropriate model then is 

Uijk(i) = Pij 4- + ^ + e《j 友⑴， （ 10.30) 

which leads to the ANOVA given in Table 10.6. In this case it may very well be useful 
and advisable to amend model (10.30) and include a manufacturer x treatment interac¬ 
tion term if one wants to investigate whether differences among treatment effects are 
manufacturer specific due to different production processes employed by the different 
manufacturers. 

In all three cases the hypothesis n = T 2 = • • • = = 0 is tested in the usual way 

by using F = MS(7 1 ) /MS(E) with t — 1 and v d.f., where MS(E) and v are computed 
differently for the three situations. 

10.3.5 Replication x Treatment Interaction 

We have given as a rationale for replicating a basic LSD our desire to increase the 
number of d.f. for the error SS in order to increase the power of the F -test for treatment 
effects. But during our discussion in Sections 10.3.2, 10.3.3 ， 10.3.4 we have already 
alluded to the fact that such replication may enable us to investigate replication x 
treatment interaction. In fact, this may very well be the major reason for replicating the 
LSD, that is, including another blocking factor, most likely an intrinsic factor, in order 
to inquire whether the performance of the treatments is the same for the different levels 
of that intrinsic factor. For this purpose we need to modify the models given above. 
Specifically, model (10.28) changes to 

Vijk{l) — + Oii + pj + 7a ： + D + [olt)h H- (10.31) 
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Table 10.6 ANOVA for Model (10.30) 


Source 


d.f. 

ss 

五 (MS) 

Replications 

r — 1 


t 2 E(Pi..(-) -y...(-)) 2 

i 


Batches/Reps 

r(t 一 1) 


t HiVij-i-) ~ Vi..(-)) 2 


Ovens/Reps 

r (卜 1) 


ik 


Treatments 

t - 1 


rt Sd. ⑴— y...()) 2 

i 

d + wE T z 2 / (卜 0 

l 

Error 

r(t - l)(t 

-2) + (r--l)(i-l) 

Difference 


Total 

rt 2 - 1 


S (vijk(i) - y...{ )) 2 



model 10.29 becomes 

Vijk(i) = /U + ai + p u + +TI + {aT)u + (10.32) 

and model 10.30 becomes 

Vijk(i) = ^ + OLi~\~ Pij + 7ifc + 77 + {o^r)n + (10.33) 

For each case we need to amend the ANOVA tables 10.4, 10.5, and 10.6 by the inter¬ 
action sum of squares (with A denoting the replication factor) 

ss(4 x r) = tJ2(yi..(i)-yi- - v...{D + y -) 2 
il 

with (r 一 l)(t 一 1) d.f. As a consequence, the error d.f. will be changed to r(t—l)(t—2)+2(r 
r(t—l)(t—2)+(r—l)(t—1)，and r(t—l)(t—2), respectively. 

To test for interaction we use the F-test 

MS{A x T) 

= MS ⑹ 

with the appropriate d.f. as given above. To deal with possible interaction we use the 
same approach as outlined in Section 9.6.8. 

10.4 LATIN RECTANGLES 

Another method of increasing the error d.f. but still maintaining the Latin square prin¬ 
ciple is to use a design with rt rows and t columns (or t rows and rt columns as rows 
and columns are obviously interchangeable). In our example, we may have rt batches 
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of material available at one time to carry out the experiment, rather than t batches at 
r different times. An appropriate error-reduction design is then obtained by “inter¬ 
mixing” r Latin squares generated by r independent randomizations. Example (ii) in 
Table 10.1 illustrates such a design. Because of its obvious geometrical appearance and 
its properties we refer to such a design as a Latin rectangle. Obviously each treatment 
occurs exactly once in each row and r times in each column. 

An appropriate linear model for this design is 


Vij{k) == M + pi + + Tfc + e ij(k)^ (10.34) 

that is, the same as for the LSD except that now i = 1,2,..., rt. The ANOVA as¬ 
sociated with model (10.34) is given in Table 10.7. Inspection shows that this is very 
similar to the ANOVA in Table 10.4. Since there are no replications in the sense of 
Section 10.3 we, of course, cannot obtain a replications x treatment sum of squares. 


10.5 INCOMPLETE LATIN SQUARES 

As we have mentioned earlier, one disadvantage of a LSD is that the numbers of rows, 
columns, and treatments must be the same. This is especially true if the number of 
treatments is large, since then the heterogeneity of the EUs in the square array may be 
appreciable as measured by the in (10.3). There exist ， however, designs with r = t 
rows and c(< t) columns which combine the Latin square property of eliminating 
heterogeneity in two directions with the property of a BIBD of comparing treatments 
with the same variance. Such designs are referred to as Youden squares since they 
were introduced by Youden (1937) after Yates (1936) considered the special case of 
c = t — 1. 

Example 10.5: A Youden square for t = r = 7, c = 3 is given below (with the 
treatments numbered 1, 2,..., 7): 


Column 


Row 

1 

2 

3 

1 

1 

2 

4 

2 

2 

3 

5 

3 

3 

4 

6 

4 

4 

5 

7 

5 

5 

6 

1 

6 

6 

7 

2 

7 

7 

1 

3 


When using this design we would, of course, randomly assign the treatments to the 
numbers 1,2,..., 7 and then randomize the rows and columns. It is not difficult to 
verify that this design with rows as blocks is indeed a special arrangement of the BIBD 
(7,7, 3,3; 1). □ 
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Table 10.7 ANOVA for Model (10.34) 


Source 

d.f. 

SS 五 (MS) 

Rows 

rt — 1 


Columns 

t - 1 

rt Yl(y.jn - 

Treatments 

t - 1 

rtYl(y..(k) - V..{-)) 2 + rt^2r^/(t - 1) 

k k 

Error 

r(t-l)(t-2) + 2(r-l)(t-l) 

Difference 

Total 

rt 2 -l 

Y2 (Vij(k) - y..(-)) 2 



0 •⑻ 


More generally, Hartley and Smith (1948) have shown that for all BIBDs with 
t = b such arrangements exist. A listing of these is given in Cochran and Cox (1957). 
It should be clear from the description of these designs that the columns are orthogonal 
to the rows and treatments, but the rows are not orthogonal to the treatments since not 
every treatment occurs in every row. This implies that, using model (10.16) for the 
analysis, the estimates of the treatment effects have to be adjusted for row effects, that 
is, no longer can treatment means be used to estimate treatment effects but LS means 
must be obtained (see Chapter II.2 and Section II.6.5). 

Youden squares can also be used to generate designs with c > t using a method 
similar to that of constructing extended block designs (see Section 9.8). We simply 
adjoin to a Latin square (or multiples of Latin squares) a Youden square. Example (iii) 
in Table 10.1 provides a trivial application of this idea. We may refer to such designs 
as extended Latin square designs. 

Finally, the Latin square idea can be modified to include designs with r = at 
(a integer), c < t such that the BIBD property holds with rows as blocks and each 
treatment occurs a times in each column. We call these designs extended incomplete 
Latin squares. A listing of such designs is provided by Cochran and Cox (1957). 


10.6 ORTHOGONAL LATIN SQUARES 

10.6.1 Graeco-Latin Squares 

An interesting and sometimes useful generalization of the LSD is obtained by con¬ 
sidering elimination of heterogeneity in more than two directions. For elimination of 
systematic variation in three directions consider the following example. 

Example 10.6: We want to compare the “usefulness” of four different word pro¬ 
cessing softwares (A, B, C, D) using four different PCs, four secretaries, and four 
different texts (a, 0, 7 ,5). We want to eliminate differences among PCs, secretaries, 
and types of text. A suitable arrangement may be as follows: 
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Secretary 

PC 

1 

2 

3 

4 

1 

Aa 

B 7 

C6 

D0 

2 

B3 

AS 


Col 

3 

Cl 

Da 

Ad 

B5 

4 

DS 

C3 

Ba 

Aj 


that is, secretary 1 types text a on PC 1 with word processor A, and so forth. □ 

This design has interesting combinatorial properties: if we ignore the Greek letters 
we have a Latin square; if we ignore the Latin letters we also have a Latin square; 
in addition, each Greek letter occurs exactly once with each Latin letter. We have 
superimposed two Latin squares on each other with the resulting third property. We 
refer to such an arrangement as two orthogonal Latin squares, or more specifically, as 
a Grceco-Latin square (the name is suggested by the use of Greek and Latin letters). 
Such designs exist for all t except t = 6, Grsco-Latin squares for t = 3, 4, 5, 7, 
8 , 9, 11, 12 are given by Cochran and Cox (1957), for t = 10 by Bose, Shrikhande 
and Parker (1960), and Fisher and Yates (1957) list complete sets of orthogonal Latin 
squares for t = 3, 4, 5, 7, 8 , 9. Pairs from these sets can be used to obtain different 
Greeco-Latin squares (superimposing three orthogonal Latin squares will yield a design 
suitable to eliminate heterogeneity in four directions, and so on). 

A model for analyzing data from a Grseco-Latin square design is an obvious exten¬ 
sion of (10.16), that is, 

Vij{kl) = + Pi ~r ^/j ~r Sk +ri -e i： j ⑽ (10.35) 

where pi are the row effects, 7 j are the column effects, represent the effects of 
the blocking factor associated with the Greek letters, and 77 are the treatment effects 
k ) l = 1.2,..., t). The fact that out of all possible t 4 combinations (z, j, fc, 0 only 
t 2 occur in a Graeco-Latin square is indicated in the subscript notation ij(kl) for the 
observations in model (10.35). As a consequence, after accounting for t — 1 d.f. each 
for rows, columns, Greek letters, and treatments in the ANOVA table (see Table 10.8) 
only (t - 1)(^ — 3) d.f. remain for error. For small t this is usually not sufficient, but 
matters may be improved through appropriate replication. Yet, the Graeco-Latin square 
suffers from the same (and more) restrictions than the LSD and that may impede its 
usefulness in practical applications somewhat. 


10.6.2 Mutually Orthogonal Latin Squares 

The process of superimposing orthogonal Latin squares, and thereby creating error- 
control designs to eliminate additional sources of variability, can be continued for most 
values of t. For example, when t is a prime number or a power of a prime number then 
there exists at x t square with each cell containing a letter of t — 1 languages，such that 
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Table 10.8 ANOVA Table for Graeco-Latin Square 


Source 

d.f. 

SS 

E(MS) 

Rows 

t 一 1 

t ZX 仏 .(..）一 y ..(..)) 2 


Columns 

t-1 

% 


Greek Letters 

t-l 



Treatments 

t-1 

^ ZXy..(.z) - y..{..)) 2 

^ + T i/{^ - !) 

Error 


Difference 

a 2 e 

Total 

t 2 ~i 

Yj (Uij(ki) — 
ijkl 



the letters of any two languages form a square with the Graeco-Latin square property 
(Bose ， 1938; Stevens, 1938). Consider the following example. 

Example 10.7: For t = 4 with the t ~ 1 = 3 languages being Latin letters，Greek 
letters, and numerals, an arrangement of three orthogonalized squares is given by 


Aal 

B02 

Cj3 

D54 


A63 

Da2 

C81 

CS2 

Dyl 

AM 

Ba3 

D03 

Ca4 

BS1 

Aj2 


Such squares are referred to as completely orthogonalized squares. In general the, 
say, k ^t — l superimposed Latin squares with the property described above are called 
mutually orthogonal Latin squares (MOLS). As error-control designs they can be used 
to eliminate then fe + 2 sources of variation. 

10.7 CHANGE-OVER DESIGNS 

The structure of a Latin square forms the basis for a variety of error-control designs. 
In this section we shall discuss briefly such a situation where individuals (subjects) 
are used as one blocking factor and time period as the other blocking factor. These de¬ 
signs have been used extensively in different kinds of experimental settings, but mainly 
in the pharmaceutical industry during the testing of new drugs, in animal science for 
feeding experiments, and in psychological studies. The basic idea is that each individ¬ 
ual receives (sequentially) all or some of the treatments, one at any given time period, 
and that for different individuals the order of the application of the treatments is being 
changed. And even though the designs for different applications have the same fea¬ 
tures, they are often referred to by different names, such as cross-over designs ， change¬ 
over designs, carry-over designs, switch-over designs ， counter-balanced designs, and 
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sometimes more generally and generically by repeated measurement designs (see also 
Chapter 14). Also, because there often exists considerable variability among subjects 
the fact that each subject is exposed to every treatment is described as “each subject 
being its own control.” 

10.7.1 Two-Treatment Change-Over Design 

In its simplest form a change-over design consists of n = 2r subjects to compare two 
treatments, A and B say, over two time periods. In this situation r subjects receive the 
treatments in the sequence A — B, that is, treatment A in period 1 and treatment B 
in period 2. The remaining r subjects receive the treatments in sequence B — A, that 
is, in reverse order. With periods as “rows” and subjects as “columns” this design has 
obviously the form of a Latin rectangle (see Section 10.4), intermixing r2 x 2 Latin 
squares. 

The usefulness of this design depends on whether: 

(i) we assume that because of a suitable “wash-out period” between periods 1 and 2 
the observations in period 2 are affected only by the treatment applied in period 
2 , that is, the treatment applied in period 1 has no effect on the outcome, or 

(ii) there is a carry-over effect from period 1 to period 2 , that is, the treatment applied 
in period 1 has an effect on the observation in period 2, in addition to the effect 
of the treatment applied in period 2 . 

An appropriate model for situation (i) is 

Vij(k) = M + Pi + Sj + + ey ⑻， （ 10.36) 

where pi is the effect of the ith period and Sj is the effect of the jth subject with 
i = 1, 2; j — 1, 2, .... 2r. Model (10.36) is, of course, the same as model (10.34). 

For situation (ii) model (10.36) has to be amended for observations in period 2 
to account for the carry-over or residual effects. Suppose that subject 1 receives the 
treatments in the sequence A — B, and subject 2 receives the treatments in the sequence 
B — A. Then the observations y\\A>,V 2 \B^V\ c iB^V 22 A can be modelled as follows: 

VllA = " + Pi + + T/i + eiiA 

U21B = " + 灼 + + 7 乂 + €-21B (10.37) 

I/12B = /i• + pi + 52 + + ei2B 

P22A = " + P2 十 & +『A + + e 22A) (10.38) 

where ja and jb represent the residual effects for treatments A and B, respectively. 
To make the distinction clear, ta and tb are also referred to as the direct effects of 
treatments A and B, respectively. More generally we may write models 10.37 and 
10.38 as 

Vij(kl) = !^ -\- Pi + Sj + Tk -r 7 ⑴ + e ij(kl )， (10.39) 

where Tfc refers to the direct effect of the treatment assigned to subject j in period i t 
and 7 ⑴ represents the residual or carry-over effect of the treatment assigned to subject 
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j in period i — 1 , with 7 (/) = 0 for i = 1 , that is, there is no carry-over effect for 
observations in period 1 . 

Returning to our basic design which is essentially of the form 



Sequence 

Period 

1 2 

1 

A B 

2 

B A 


with the two sequences repeated r times. For purposes of parameter estimation it is 
sufficient to simply consider the case r* = 1. We thus have four observations, that is, 
three degrees of freedom. We know already that under (i) above and model (10.36) we 
attribute one d.f. each to periods, subjects (sequences)，and treatments (see Table 10.8). 
Repeating the sequences merely provides error d.f. (see Table 10.8). 

Considering now situation (ii) above and model (10.39) we recognize immediately 
that the suggested design does not provide enough d.f. to estimate unbiasedly con¬ 
trasts for all four sets of parameters. Thus, another type of design is required. One 
such design with two periods was proposed by Balaam (1968), although he originally 
considered situation (i) above with the possibility of including period x treatment inter¬ 
action. His design consists of r repeats of the following sequences: 



Sequence 

Period 

1 

2 

3 

4 

1 

A 

B 

A 

B 

2 

B 

A 

A 

B 


Using model (10.39) the expected values of the eight means from this design can 
be written as 


Mil A 

= 

fM+Pl+Sx+TA 

M21 B 


M + P 2 + 4- rs + 7 a 

"12 B 

= 

/j,Pi + s 2 r B 

^22 A 

= 

〆 + P 2 + 幻 + 十 7s 



+ S 3 +TA 


= 

/U + P2 + 53 + ta + 7A 

1^14 B 

= 

// + Pi + «s 4 + 

M24B 

= 

M + P2 + 54 + Ts + 7s 


Contrasts among these fiijk can then be used to obtain functions of ta — 丁 b and 7^4 —jb 
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only. Specifically, if we write 


Mil A — 1^21 S = ® 

M12B — ^22 A = @ 

^13A~^23A = @ 

Ml4S — l^2AB = ④ 

we then find that 

@ -③ = 7^ -7 b 

and 

① -@ -④ + @ = 2 (ta — tb)- 

Other possible designs may use three or more periods rather than two. Examples 
of three-period designs are 



Sequence 



Sequence 

Period 

1 

2 


Period 

1 2 

1 

A 

B 

or 

1 

A B 

2 

B 

A 


2 ! 

B A 

3 

A 

B 


3 

B A 


repeating each sequence r times. For still other designs and more details see Jones and 
Kenward (2003) or Section II. 19.5.8. 

For analysis purposes we may rewrite model 10.39 in matrix form as 


y = fi3 -{- Xpp + X s s + X t t H~ X-y-y + e (10.40) 

using obvious notation and apply the general methods discussed in Chapter 4. More 
precisely, (10.40) is a 5-part linear model and we can write down the full set of NEs 
or obtain the reduced NE for r and 7 and solve for the desired effects. Furthermore, 
we can obtain SS(X r |3, X p , X s , X 7 ) and SS(X 7 |3, X p , X s , X r ) to test hypotheses con¬ 
cerning the direct and residual treatment effects, respectively. 

Lucas (1957) and Cochran and Cox (1957) have pointed out that, because of the 
nonorthogonality of change-over designs, residual effects are estimated less precisely 
than direct effects. To alleviate this problem and to achieve orthogonality between 
direct and residual effects Lucas (1957) suggested to add an extra-period, referred to 
as preperiod, to the basic design, that is, change, for example 
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Sequence 

Period 1 

1 2 

1 

A 

B 

2 

B 

1 

A 

3 

A 

B 


Period 

Sequence 

1 2 

0 

A B 

1 

A B 

2 

B A 

3 

A B 


without taking observations in period 0, the preperiod. In the augmented design each 
combination of direct and residual effects occur exactly once (or r times for the entire 
design), namely (ta-^a), {ta-1b), {j b .,^a)-, {tb-Ib), implying orthogonality. This 
in turn simplifies the analysis as sketched above. 


10.7.2 Change-Over Designs for More than 
Two Treatments 


For the general case of ^(> 2) treatments in a change-over design Williams (1949, 
1950) showed how Latin squares can be used to obtain what he called balanced resid¬ 
ual effects designs. The basic properties of these designs are that each treatment occurs 
the same number of times, Ai say, and each treatment is preceded by every other treat¬ 
ment the same number of times, say 入 2 (this is actually a special case of the more 
general definition of a balanced repeated measurement design given by Hedayat and 
Afsarinejad, 1975). These designs consist of one cyclic Latin square if t is even and of 
two cyclic Latin squares if t is odd. 

EXAMPLE 10.8: For t = 6 the design with t periods and t subjects is as follows 
(designating the treatments by numbers rather than Latin letters): 


Period 

Subject 

1 

2 

3 

4 

5 

6 

1 

1 

2 

3 

4 

5 

6 

2 

6 

1 

2 

3 

4 

5 

3 

2 

3 

4 

5 

6 

1 

4 

5 

6 

1 

2 

3 

4 

5 

3 

4 

5 

6 

1 

2 

6 

4 

5 

6 

1 

2 

3 


This example illustrates the general method of constructing these designs. For subject 
1 the treatments 1 . 2,.... t/2 occur in the periods 1,3,..., t — 1, respectively, and the 
treatments t/2 + 1. 1/2 + 2,... ,t occur in the periods t,t-2, .., ,2, respectively. The 
assignments for the remaining subjects are obtained by simply adding 1.2,... ,t — 
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1 to the treatments for subject 1 (with reduction modulo t) for subjects 2,3, 
respectively. 

For t odd we use two cyclic Latin squares for 2t subjects and t periods. 
Example 10.9: For t = 5 the design is as follows: 


Period i 

Subject 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 


1 

2 

3 

4 

5 

2 

3 

4 

5 

1 

2 

5 

1 

2 

3 

4 

3 

4 

5 

1 

2 

3 

2 

3 

4 

5 

1 

1 

2 

3 

4 

5 

4 

4 

5 

1 

2 

3 

4 

5 

1 

2 

3 

5 

3 

4 

5 

1 

2 

5 

1 

2 

3 

4 


Here treatments 1,2,.... (^H-1)/2 occur in periods 1,3,... ? t, respectively, and treat¬ 
ments (t + 1)/2 -f 1, (t + 1)/2 + 2,... ,iin periods t - l,t - 3,...,2, respectively, 
for subject 1. The assignments for subjects 2,3,.... i are obtained through a cyclic 
development of the arrangement for subject 1 as described previously. The arrange¬ 
ment for subject t + 1 is the mirror image, that is, reverse order, of the arrangement 
for subject t, and the assignment for the remaining subjects is again obtained through 
cyclic development. For a more general discussion see Section II. 19.5.1. 

In the behavioral science literature these designs are often referred to as completely 
counterbalanced Latin squares (Wagenaar, 1969) (see also Section 13.4). This does not 
mean, however, that for these designs the direct and residual effects are orthogonal to 
each other. Orthogonality can be achieved, as discussed earlier, by adding a preperiod 
with the same treatment arrangement as in period 1, so that every treatment is also 
preceded A 2 times by itself. With or without the preperiod an appropriate model is of 
the form of (10.39). 

10.7.3 Some Variations and Extensions 

There exist, obviously, many variations and extensions of these designs. For example 
we may have p, the number of periods, less than t, the number of treatments. This may 
occur when the number of treatments is large and, because of fatigue, not each partic¬ 
ipant can be assigned each treatment, or assigning each treatment to each participant 
may simply take too much time, in particular if a sufficient number of participants is 
available for the experiment. In this situation the subjects may represent some form 
of incomplete block, and the basic building block may be incomplete Latin squares 
(Patterson, 1950, 1951 ， 1952). Afsarinejad (1990) has extended the algorithm for the 
Williams designs (Examples 10.8 and 10.9) to construct balanced designs for the case 
p < t (see also Section II. 19.5.2). We shall not provide details here, but give the 
following example. 







10.7. CHANGE-OVER DESIGNS 


403 


Example 10.10: For t = 5 andp = 3 the following design is a balanced design with 
10 subjects in which each treatment is preceded and followed by every other treatment 
exactly once: 


Period 

Subject 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

1 

1 

2 

3 

4 

5 

3 

4 

5 

1 

2 

2 

5 

1 

2 

3 

4 

5 

1 

2 

3 

4 

3 

3 

4 

5 

1 

2 

1 

2 

3 

4 

5 


We may not have enough subjects for a balanced design as described above. This 
has led to the development of partially balanced designs (Patterson and Lucas, 1962) 
using PBIB designs for purposes of construction (see also Blaisdell and Raghavarao, 
1980) (see Sections II. 19.5.2 and 19.5.3). 

An alternative design, proposed by Balaam (1968)，uses only two periods but t 2 
subjects for t treatments. The basic idea is to assign all t(t - 1) ordered pairs of 
treatments to 亡 (1 — 1) subjects for periods 1 and 2 and use t subjects receiving the same 
treatment in both periods, as illustrated in the following example. 

Example 10.11: For t = 3 the Balaam (1968) design is given by 



Subjects 

Period 

1 

2 

3 

4 

5 

6 

7 

8 

9 

1 

A 

A 

B 

B 

C 

C 

A 

B 

C 

2 

B 

C 

C 

A 

A 

B 

A 

B 

C 


An extension of the designs discussed so far leads to designs balanced for second 
order residual effects (Williams, 1949, 1950). In this case we consider carry-over ef¬ 
fects not just from the immediately preceding treatment but also from the treatment 
applied two periods prior to the present application. For such a situation model (10.40) 
needs to be amended by an additional term representing the second order residual ef¬ 
fect. Obviously the construction of such designs becomes more complicated. Williams 
(1949,1950) showed, for example, how mutually orthogonal Latin squares can be used 
to achieve the goal. 


EXAMPLE 10.12: For t = 4, p = 4 and s = 12 subjects Williams gives the following 
design: 
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Period 


Subjects 



Subjects 



Subjects 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

1 

4 

3 

2 

1 

4 

3 

2 

1 

4 

3 

2 

1 

2 

1 

2 

3 

4 

3 

4 

1 

2 

2 

1 

4 

3 

3 

2 

1 

4 

3 

1 

2 

3 

4 

3 

4 

1 

2 

4 

3 

4 

1 

2 

2 

1 

4 

3 

1 

2 

3 

4 


The distinctive feature of this design is that each treatment is preceded exactly once 
by each ordered pair of treatments. For more details see Section II.19.8.5. □ 


10.8 EXAMPLES USING SAS® 


Example 10.13: Consider an agricultural experiment with t = b treatments in a 
Latin square design layout due to fertility gradients in two directions. The design and 
the data are given in Table 10.9a. 


Treatment 1 represents a control and the main objective is to compare treatments 
2, 3, 4, 5 versus treatment 1. The input statements using SAS PROC GLM are given 
in Table 10,9a. In addition to considering the comparison of treatment 1 versus the 
average of the remaining four treatments, we want to perform Dunnett’s procedure 
(see Section 7.5.7). 

The results are given in Table 10.9b: 

(i) The overall treatment differences are significant at P = .0523. 

(ii) Dunnetfs procedure shows treatments 2 and 5 are clearly significantly different 
from treatment 1 (P = .0459 and P = .0218, respectively), whereas treatment 
3 is marginally significantly different from treatment 1 (P = .1138) 

(iii) Having specified a = .10, 90% simultaneous confidence intervals for Ti — T\ 
(i = 2, 3, 4, 5) are provided. 

(iv) The estimate for (T 2 + 了 3 + T 4 + — is 20.15, indicating that on the average 

the new treatments provide a higher yield than the control. □ 


Example 10.14: A multi-farm trial was performed to evaluate the effectiveness of 
different doses, low (L), medium (M), high (H), of a food additive on growth of cattle. 
In addition to the three doses a control (C) was included in the experiment. The inves¬ 
tigator decided to block on farms and weight classes, leading to a 4 x 4 Latin square 
design. Two breeds were included in the experiments, using four farms for each breed. 
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Table 10.9 Latin Square Design 


a) Input statements: 

data weight; 
data LatinS q; 
input row column trt y 
datalines; 

1 1 1 94 1 23 100 1 3 498 142 101 1 55 112 

2 1 3 103 222 111 23 1 51 24 5 1102 5 4 90 

3 1 4 114 3 2 1 75 3 3 5 94 3 4 3 85 3 5 2 107 

4 1 5 100 4 2 4 74 4 3 2 70 4 4 1 93 4 5 3 106 
512 106 525 95 53381544 90 551 73 

run; 

proc glm data=LatinSq; 
class row column trt; 
model y=row column trt; 

lsmeans trt/stderr pdiff cl adjust=Dunnett alpha=.10; 

estimate ’ 1 vs (2+3+4+5)’ trt -4 1 1 1 1 /divisor=4; 

title 1 ’LATIN SQUARE DESIGN’； 

title2 ’ANALYSIS OF VARIANCE，； 

title3 ^/POST-HOC ANALYSIS’； 

run; 

b.) Output: 


LATIN SQUARE DESIGN 
ANALYSIS OF VARIANCE 
W/FOST-HOC ANALYSIS 

The GLX Procedure 
Class Level 二 reformation 
Class Levels Values 

row 5 12345 

column 5 12345 

trz 5 1 2 3 4 5 


Number cf Observations Read 25 

Number of Observations Used 25 


Dependent Variable : y 


Source 

DF 

Surr. of 
Squares 

Mean Square 

F Value 

?r > F 

Model 

12 

4094.720000 

341.226567 

2.34 

0.C774 

Error 

12 

1748.7200C0 

145.726667 



Corrected Total 

24 

5843.440000 





R-Square 

Coeff Var 

Root MSE 

y Mean 



0.700738 


12.93584 


12 .C7173 


93.32000 
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Table 10.9 {Continued) 


Source 

DF 

Type I SS 

Mean Square 

F Value 

Pr > F 

row 

4 

514.240000 

128.560000 

0.88 

0.5033 

column 

4 

1711.440000 

427.860000 

2.54 

0.0661 

trt 

4 

1869.C40C00 

467.260000 

3.21 

0.0523 


Source 

DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

row 

4 

514.240000 

128.560000 

0.88 

0.5033 

column 

4 

1711.440000 

427.860000 

2.94 

0.0661 

trt 

4 

1869.040000 

467.26000C 

3.21 

0.0523 


Least Squares Means 

Adjustment for Multiple Comparisons : Dunnett 


HO : LSMear.= 


trt 

y LSMEAN 

Standard 

Error 

HO:LSMEAN=0 
?r > |t| 

Control 
Pr > 11 ! 

1 

77.200000 

5.398642 

<.00C1 


2 

99.000000 

5.398642 

<.0001 

0.0459 

3 

35.000000 

5.398642 

<.0C01 

0.1138 

4 

93.200000 

5.398642 

<.0C0I 

0.1680 

5 

1C2.20C00G 

5.398642 

<•0001 

0.C218 


trt y LSMEAN 90% Confidence Limits 


2 

3 

4 

5 


77.200000 

99.000000 

95.000000 

S3.2G0000 

102.200000 


67.578068 

89.378068 

85.378068 

83.578068 

92.578068 


86.821932 

108.621932 

104.621932 

102.821932 

111.821932 


Least Squares Means for Effect trt 


Difference 

Between 

Means 


Simultaneous 90% 
Confidence Limits for 
LSMean(i)-LSMean(j) 


2 

3 

4 

5 


21.800000 

3 

17.800000 

-0 

16.000000 

-2 

25.000000 

6 


415303 40.184697 
584697 36.184697 
384697 34.384697 
615303 43.384697 


Dependent Variable : y 


Parameter 


Standard 

Error t Value Pr > 11 


1 vs (2+3+4+5) 


20.1500000 


6.03586503 


3.34 


0.0059 



10.8. EXAMPLES USING SAS® 


407 


Thus, we have a replicated LS design with farms nested in breeds and weight classes 
crossed with breeds. 

The design and the data are given in Table 10.10a and the results of the analysis, 
using SAS PROC GLM, are given in Table 10.10b: 

(i) Differences among dosages and interaction between breed and dosage were highly 
significant (P < .0001). 

(ii) The LS means for breed*dosage indicate that for both breeds growth remains 
nearly the same for C, L, M ，but increases for H, with a substantially higher 
increase for breed 2, leading to the significant interaction. The interaction is, 
however, co-directional. Hence it is meaningful to assert that on average, dosage 
H leads to higher growth by about 10 kg, 153 vs 163. 

(iii) The slice operator performs separate analyses (but using the same error term, 
MS(Error ) 二 1.93) and concludes that for both breeds the differences among 
dosages are highly significant, due, of course, to the performance of H. □ 

Example 10.15: Consider the following crossover design with t = S treatments and 
p = 3 periods. There are six possible sequences of assigning the treatments over the 
three periods used in the experiment. Each sequence is replicated twice. The plan and 
the data are given in Table 10.11a. 

We use SAS PROC GLM to analyze the data. The input statements are given in 
Table 10.11a. Since for the observations in period 1 there are carry-over effects we put 
a “0” in the column for carry-over effects. Since this will result in “non-estimable” 
LS means for treatments we follow Ratkowsky, Evans and Alldredge (1993) to insert 
the statement “if carry = ‘0’ then carry = ‘3’ ’’ to alleviate this problem (we shall 
comment on this below). 

The analysis of the data is given in Table 10.1 lb: 

(i) The general form of estimable function shows that differences between treatment 
and carry-over effects are estimable. 

(ii) There are highly significant differences among the treatments (P < .0001). 

(iii) Differences among carry-over effects are not significant (P = .41). 

(iv) The coefficients for trt and carry LS means show how those LS means are ob¬ 
tained from the solution vector (not shown here). 

(v) The treatment LS means together with their standard errors: 

56.71 士 1.05 
52.69 士 1.05 
47.14 土 .85 
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Table 10.10 Replicated Latin Square Design 


a) Input statements: 


data repLS 

input breed farm wclass dosage $ response 
datalines; 

1 1 1 H 142 1 1 2L 144 1 1 3C 148 11 4M 150 
l 2 1 M 138 1 2 2 C 143 1 2 3 L 145 1 2 4 H 154 
1 3 1 C 138 1 3 2 M 144 1 3 3 H 153 1 3 4 L 149 

1 4 1 L 139 1 42 H 149 1 43 M 144 1 44C 150 

2 1 1 L 158 2 1 2 C 161 2 1 3 M 165 2 1 4 H 180 
22 1C 155222L 160223H 17822 4M 167 
2 3 1 M 158 2 3 2 H 175 2 3 3 L 163 2 3 4 C 167 
24 1 H 174 2 4 2M 161 24 3 C 164 24 4 L 168 


run; 

proc glm data=repLS; 
class breed farm wclass dosage; 

model response: breed farm(breed) wclass dosage breed*dosage; 

lsmeans dosage/stderr; 

lsmeans breed*dosage/stderr slice=breed; 

title 1 ’REPLICATED LATIN SQUARE DESIGN'; 

title2 ’ (r=2, t=4, Rows Nested in Replications )’； 

title3 ’ANALYSIS OF VARIANCE ’； 

run; 

b.)Output: 


REPLICATED LATIN SQUARE DESIGN 
(r=2, t=4. Rows Nested in Replications) 
ANALYSIS OF VARIANCE 

The GLM Procedure 

Class Level Information 


Class 

Levels 

Values 

breed 

2 

1 

2 


farm 

4 

1 

2 

3 4 

wclass 

4 

1 

2 

3 4 

dosage 

4 

C 

H 

L M 


Number of Observations Read 32 

Number of Observations Used 32 


Dependent Variable : 

response 

Sum cf 




Source 

DF 

Squares 

Mean Square 

F Value 

Pr > F 

Model 

16 

4470.250000 

279.390625 

140.87 

<.0001 

Error 

15 

29.750000 

1.983333 



Corrected Total 

31 

45C0.00C00G 
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Table 10.10 {Continued) 


R-Square 

Coeff Var 

Root MSE 

response Mean 

0.993389 

0.904211 

i.408309 

155.7500 


Source 

DF 

Type I SS 

Mean Square 

F Value 

Pr > F 

breed 

1 

3280.500000 

328C .500000 

1654.03 

<.0001 

farm(breed) 

6 

9.00C000 

1.500000 

0.76 

0.6146 

wclass 

3 

466.750000 

155.583333 

78.45 

<.0001 

dosage 

3 

580.250C00 

193.416667 

97.52 

<.0001 

breed*dosage 

3 

133.7500CO 

44.583333 

22.48 

<.C001 


Source 

DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

breed 

1 

3280.500000 

3280.500000 

1654.03 

<•0001 

farm(breed) 

6 

S.OOOCOO 

1.500000 

0.76 

0.6146 

wclass 

3 

466.750000 

155.583333 

78.45 

<.0001 

dosage 

3 

580.250000 

193.416667 

97.52 

< . 0 0 C1 

breed^dosage 

3 

133.750000 

44.583333 

22.48 

<•0001 



Least Squares Means 





response 

Standard 




dosage 

1-SHEAN 

Error 

Pr > It 1 



Q 

153.25COOO 

0.4S7S12 

〈 •00C1 



H 

163.125000 

0.497312 

<.00C1 



L 

153.250000 

0.497912 

<.0001 



V 

153.375000 

0.457912 

〈 •C001 



breed 

dosage 

response 

1SMEAN 

Standard 

Error 

Pr > |tl 

1 

c 

144.750000 

0.704154 

<.0001 

1 


149.50C00C 

0.704154 

<.0001 

1 


144.250000 

0.704154 

<•0001 

1 

M 

144.000000 

0.704154 

<.0001 

2 

C 

161.750000 

0.704154 

<.0001 

2 

H 

176.750CO0 

0.704154 

<.000 ： 

2 

L 

162.250000 

0.704154 

<.0001 

2 

M 

162.750000 

0.7C4154 

<.0001 


breed+dosage Effect Sliced by breed for response 


Sum of 
Squares 


Mean Square 


81.250000 

632.750000 


27.083333 

210.916667 


13.66 

106.34 


0.0001 

<.0001 
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Table 10.11 Crossover Design 


a) Input statements: 


data cross; 

input period sequence steer trt carry y 
if carry=‘0’ then carry: ‘3 ’； 
datalines; 

I 1 1 1 050 2 1 1 2 1 61 3 1 1 3 253 

II 21055 21 221 63 31 2 3 257 
1 2 3 204422 33 24232 3 1 3 57 

12 42051 22 432 46 32 413 59 

13 5 3 035 2 3 513 55 33 521 47 

13 63041 23 613 56 33 621 50 

14 710 54 24 731 48 34 72351 

14 810 58 24 8 3 1 51 34 8 2 3 54 

15 920 50 25 912 57 35 931 51 
1 5 10 2 0 55 2 5 10 1 2 59 3 5 10 3 1 55 
1 6 11 3041 26 11 235636 11 1 258 
I 6 12 3 0 46 2 6 12 2 3 58 3 6 12 1 2 61 

run; 

proc glm data=cross; 

class period sequence steer trt carry; 

model y=period sequence steer(sequence) trt carry/e; 

lsmeans trt carry/stderr e; 

estimate ’1-2’ trt 1 -1 0; 

estimate ’1-3’ trt 1 0 -1; 

estimate ’2-3’ trt 0 1 -1; 

title 1 ’CROSSOVER DESIGN ’； 

title2 ? USING COUNTERBALANCED LATIN SQUARES’; 
run; 

b.) Output: 


CROSSOVER DESIGN 

USING COUNTERBALANCED LATIN SQUARES 
The GLM Procedure 
Class Level Information 


Class 

Levels 

Val- 

-es 

period 

3 

1 

2 

3 

sequence 

6 

1 

2 

3 

steer 

12 

1 

2 

3 

trt 

3 

1 

2 

3 

carry 

3 

1 

2 

3 


1 12 


Nurr.ber of Observations Read 


36 
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Table 10.11 {Continued) 


General Form of Estimable Functions 


Effect 


Coefficients 

Intercept 


LI 

period 

i 

L2 

period 

2 

L3 

period 

3 

L1-L2-L3 

sequence 

1 

L5 

sequence 

2 

L6 

sequence 

3 

L7 

sequence 

4 

L8 

sequence 

5 

L9 

sequence 

6 

L1 _ L5_L6-L7 - L8-L9 

steer(sequence) 

11 

Lll 

steer(sequence) 

2 1 

L5-L11 

steer(sequence) 

3 2 

L13 

steer(sequence) 

4 2 

L6-L13 

steer(sequence) 

5 3 

L15 

steer(sequence) 

6 3 

L7-L15 

steer(sequence) 

7 4 

L17 

steer(sequence) 

8 4 

L8-L17 

steer(sequence) 

9 5 

L19 

steer(sequence) 

10 5 

L9-L19 

steer(sequence) 

11 6 

L21 

steer(sequence) 

12 6 

L1 _ L5-L6 - L7-L8 _ L9 _ L21 

trt 

1 

L23 

trt 

2 

L24 

trt 

3 

L1-L23-L24 

carry 

i 

L2 6 

carry 

2 

L27 

carry 

3 

L1-L26-L27 


Dependent Variable : y 


Source 


BF 

Sum of 
Squares 

Mean 

Square 

F Value Pr > F 

Model 


17 

1302.513889 

76, 

,618464 

8.74 <.0001 

Error 


18 

157.791667 

8. 

,766204 


Corrected 

Total 

35 

1460.305556 





R-Square 

Coeff 

Var Root 

MSE 

y 

Mean 


0.891946 5.654535 2.960778 52.36111 
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Intercept 

period 

period 



sequence 

sequence 

sequence 

sequence 

sequence 

sequence 

steer(sequence) 

s~eer(sequence) 

steer(sequence) 

steer(sequence) 

szeer (sequence) 

steer(sequence) 

steer(sequence) 

steer(sequence) 

steer(sequence) 

steer(sequence) 

steer(sequence) 

steer(sequence) 

trt 

trt 

trt 

carry 

carry 

carry 


C.33333333 
0.33333333 
0.33333333 
0.16666667 
0.16666667 
0.16666667 
0 . 1666666 "? 
0.16666667 
0.16666667 
0.08333333 
0.C8333333 
0.08333333 
0.C8333333 
0.08333333 
0.08333333 
0.08333333 
0.08333333 
C.08333333 
0.08333333 
0.08333333 
0.C8333333 
1 
0 
0 

0.33333333 
0.33333333 
C.33333333 


0.33333333 
0.33333333 
0.33333333 
0.16666667 
0.16666667 
0.16666667 
0•16666667 
0.16666667 
0.16666667 
0.08333333 
0.08333333 
C.08333333 
0.08333333 
0.08333333 
0.08333333 
0.08333333 
C.08333333 
0.08333333 
0.0B333333 
0.08333333 
0.08333333 
0 
1 
0 

0.33333333 

0.33333333 

0.33333333 


1 

0.33333333 
0.33333333 
0.33333333 
C.16666667 
0.16666667 
0.16666667 
0.16666667 
0.16666667 
C.16666667 
0.C8333333 
C _ 08333333 
0.08333333 
0.08333333 
0.08333333 
0.C8333333 
C.08333333 
0.08333333 
0.08333333 
0.08333333 
0.08333333 
0.08333333 
0 
0 
1 

0.33333333 

0.33333333 

0.33333333 


Table 10.11 (Continued) 


Source 

D? 

Type I SS 

Mean Square 

F Value 

Pr > F 

period 

2 

292.0555556 

146.0277778 

16.66 

<.0C01 

sequence 

5 

326.4722222 

65.2944444 

7.45 

0.0006 

steer(sequence) 

6 

118.5000000 

19.7500000 

2.25 

0.0849 

trt 

2 

549.0555556 

274.5277778 

31.32 

<•0301 

carry 

2 

16.43C5556 

8.2152778 

0.94 

0.4100 


Source 

DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

period 

2 

172.3072917 

86.1536458 

9 

.83 

0.0013 

sequence 

5 

318.6916667 

63.7383333 

7 

.27 

0.0007 

sreer(sequence) 

6 

118.5000000 

19.7500000 

2 

• 25 

C . 0849 

trt 

2 

440.6083333 

220.3041667 

25 

.13 

<.0001 

carry 

2 

16.4305556 

3.2152778 

0 

. 94 

0.4100 


Coefficients for trt. Least Square Means 
trz Level 

Effect 123 
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Table 10.11 {Continued) 


trt 

y LSMEAN 

Standard 

Error 

Pr > !ti 

1 

56.7083333 

1.0467929 

<. 0 C 0 i 

2 

52.68750CO 

1.0467929 

<.0001 

3 

47.1666667 

0.8547029 

<.0C0i 



Coefficients 

for carry Least Square Means 




carry Level 



Effect 


1 

2 

3 

Intercept 


1 

1 

1 

period 

1 

0.33333333 

C.33333333 

0.33333333 

period 

2 

0.33333333 

0.33333333 

C.33333333 

period 

3 

0.33333333 

0.33333333 

0.33333333 

sequence 

1 

G•16666667 

G.16666667 

0.16666667 

sequence 

2 

C.16666667 

C.16666 667 

0.16666667 

sequence 

3 

0.16666667 

0.16666667 

0.16666667 

sequence 

4 

0.16666667 

0.16666667 

0.16666667 

sequence 

5 

0.16666667 

0.16666667 

0.16666667 

sequence 

6 

0.16666667 

0.16666667 

0.16666667 

steer(sequence) 

1 1 

0.08333333 

0.08333333 

0.03333333 

steer(sequence) 

2 1 

0.08333333 

0.08333333 

0.08333333 

steer(sequence) 

3 2 

0.08333333 

0.08333333 

0.08333333 

steer(sequence) 

4 2 

0.08333333 

0.C8333333 

0.0S333333 

steer(sequence) 

5 3 

0.08333333 

0.08333333 

0.08333333 

steer(sequence) 

6 3 

0.08333333 

0.08333333 

0.08333333 

steer(sequence) 

7 4 

0.08333333 

0-08333333 

0.08333333 

steer(sequence) 

8 4 

0.08333333 

C.08333333 

0.08333333 

steer(sequence) 

9 5 

C .08333333 

C■08333333 

0.08333333 

steer(sequence) 

10 5 

0.08333333 

C. 08333333 

0.08333333 

steer(sequence) 

11 6 

0.08333333 

0.08333333 

0.08333333 

steer(sequence) 

12 6 

0.08333333 

0.08333333 

C.08333333 

trt 

1 

0.33333333 

0.33333333 

0.33333333 

trt 

2 

0.33333333 

0.33333333 

0.33333333 

trt; 

3 

0,33333333 

0.33333333 

0.33333333 

carry 

1 

1 

0 

0 

carry 

2 

0 

1 

0 

carry 

3 

0 

0 

1 


carry 

y LSMEAN 

Standard 

Error 

Pr > |t! 

1 

53.0833333 

1.3514039 

<.00C1 

2 

50.7708333 

1.3514039 

<.0001 

3 

52.7083333 

0.8547029 

<.0001 


Dependent Variable : y 


Parameter 

Estimate 

Standard 

Error 

t Value 

Pr > It) 

1-2 

4.02083333 

1.35140388 

2.98 

0.0081 

1-3 

9.54166667 

1.35140388 

7.06 

<.0001 

2-3 

5.52083333 

1.35140388 

4.09 

0.000 
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intuitively confirm the differences among the treatment effects. The values for 
the LS means are, however, not unique because of the arbitrary choice to replace 
carry = 0 by carry 二 3 (see above). This also affects the standard errors (that is, 
1.05 vs. .85). In spite of this, differences between LS means are unique. 

(vi) Differences between treatment are all significantly different from zero. And the 
estimates of those differences have the same standard error. □ 

10.9 EXERCISES 

10.1 A marketing expert for a publishing house wants to measure reader preference 
for three different covers of the same paperback novel. She has chosen 10 cities 
and 3 newsstands in each city which are going to sell the novel. She wants to use 
one of two experimental setups described below. 

(a) In each city each cover is assigned randomly to one of the 3 newsstands. 
The number of books sold during a three-week period following the assign¬ 
ment is used to compare the effect of the covers on sale of the novel. 

(b) In each city each of the 3 newsstands will sell the book using each cover 
for one week (that is, the trial extends over 3 weeks) in such a way that 
during a given week the 3 newsstands in a city will display the book with 
a different cover. The same 3-week period will be used in all cities. Sales 
figures for each week will be used for the analysis. 

For each of the two scenarios described above: 

(i) Give the name of the experimental design used. 

(ii) Identify the experimental units. 

(iii) Give the model for each of the designs and the ANOVA table, including 
sources of variation and d.f. 

(iv) Indicate how you would test whether the covers had the same effect on 
sales. 

(v) Which of the two designs would you prefer in this situation and why? 

10.2 A study is planned to investigate (a) whether four gasoline additives differ with 
respect to the reduction in oxides of nitrogen and (b) if such differences exist 
whether they depend on the makes of the cars used in the study. 

The investigator has selected three makes (models) of cars, Ford ， Honda, and 
Porsche. For each model he has four cars available, and he uses four different 
drivers. He believes that for each model systematic differences are likely to occur 
in the cars’ performance. Also, even though the drivers may do their best to drive 
the car in a manner required by the test, systematic differences can occur from 
driver to driver. 
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In planning the study the investigator would like to have an experimental 
design that eliminates the car-to-car variation and the driver-to-driver variation. 
He wants to use the same four drivers for the whole experiment. 

(i) Give the name for an appropriate experimental design. 

(ii) Write out the actual assignment of the additives to car-driver combinations 
for the three models. 

(iii) Give a linear model and outline the ANOVA table for this experiment, giv¬ 
ing sources of variation and d.f. 

(iv) Give the SAS statements (classes, model) for the model in (iii). 

(v) Indicate how you would use the ANOVA to investigate the questions (a) 
and (b) raised above. 

10.3 Suppose a poultry scientist comes to you to help him set up an experiment. He 
wants to compare the effects of 3 different diets (treatments) on eggshell prop¬ 
erties. He has available 6 strains of chickens. Each chicken included in the 
experiment will be housed in a separate pen during the duration of the trial. He 
has 30 pens available which are arranged in stacks of 5 side-by-side (see diagram 
below). 


For each chicken, measurements are taken on 5 randomly selected eggs. 

(i) What kind of experimental design would you use? Give its parameters (that 

is, t, 6, etc.)? 

(ii) Give a suitable arrangement of the diets (A, B, C) to the chickens. 

(iii) For the design proposed in (ii) give an appropriate linear model and outline 
the ANOVA, giving sources of variation and d.f. 

(iv) Upon further questioning you find out that the height of the pen in the 
stack may have an effect on the outcome of the experiment (because of 
differences in the temperature). Would you change the arrangement of the 
treatments given in (ii)? If your answer is “no”，give reasons for it; if your 
answer is “yes”，give the new arrangement. 
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(v) For the new situation described in (iv) give an appropriate linear model and 
outline the ANOVA giving sources of variation and d.f. 

10.4 A paint company wants to compare the abilities of four (4) white house paints 
to withstand environmental conditions. Eight (8) square houses, each with one 
side facing exactly north, are available in each of three states: Florida, Michigan, 
and California (that is, there are 24 houses altogether). Each side of a house is 
possibly exposed to different types of weather. Also, the houses are different 
from each other (because of different building materials, different ages, etc.). 
The company wants to paint each side of each house with a different paint. In 
addition to comparing the 4 paints the company is also interested in finding out 
whether differences among the paints vary from state to state. 

(i) Describe how you would set up the experiment, that is, what error-control 
design would you use. Explain the reasons for choosing the design. 

(ii) Give an appropriate linear model for analyzing data from the experiment 
described in (i). 

(iii) Outline the ANOVA table associated with the model given in (ii) (giving 
source of variation, d.f.) and indicate how you would investigate the ques¬ 
tions the company is interested in. 

(iv) Describe how you would perform the analysis in (iii) by using SAS or some 
other statistical package. 

10.5 Derive the missing value formula (10.25). 

10.6 Obtain the ANOVA table for the analysis of covariance for the Latin square de¬ 
sign. 

10.7 Derive expressions (10.26) and (10.27) for the estimated relative efficiencies of 
the Latin square design relative to the RCBD. 

10.8 Extend model (10.30) to include replicate x treatment interaction and obtain the 
ANOVA table for this model. 

10.9 A Grasco-Latin square in its pure form may not be very useful, but extension and 
replications of it often prove to be quite useful. Consider the following design: 


Row i 

Column 

1 

2 

3 

4 

5 

6 

7 

8 

1 

Cl 

Aa 

BS 

A(3 


C5 

Da 

D0 

2 

Aa 

B,d 

Dj3 

CS 

A5 

Dj 


Ca 

3 

D5 

Cl 

A^f 

Ba 

Da 

Ap 

C,d 

B6 

4 

B0 

D5 

Ca 

D，{ 

CQ 

Ba 

A6 

A^y 


where rows, columns and Greek letters are blocking factors. 
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(i) What would you call this design? Give an appropriate linear model and 
outline the ANOVA table, giving sources of variation, d.f” and sums of 
squares. 

(ii) Give the SAS statements for analyzing data from such an experiment. 

(iii) Suppose you find out that for the experiment under consideration the “columns’ 
represent animals. More specifically, columns H represent animals from 
one breed and columns 5-8 animals from a different breed. The researcher 

is interested in finding out whether differences among treatments are breed- 
specific. 

Is the design given above appropriate for investigating this question? If yes, 
explain why; if no, indicate what you would have done differently. 

(iv) For the design in (iii) give an appropriate linear model and outline the 
ANOVA table, giving sources of variation and d.f. 

(v) Give the SAS statements for the analysis suggested in (iv). 

10.10 Write out a linear model for the error-control design using the 4x4 completely 
orthogonalized square (Section 10.6.2) and obtain the ANOVA table for this de¬ 
sign. 

10.11 Suppose at x t completely orthogonalized square is replicated r times. Write 
out one possible linear model for such an error-control design and obtain the 
associated ANOVA table. 

10.12 Using the data of Example 10.15 show numerically that differences between 
treatment LS means do not depend on the choice of x in “if carry=‘0’ then 
carry=‘ ； r’ 

10.13 Analyze the data of Example 10.15 without carry-over effects and compare the 
results with those of Table 10.11b. 
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CHAPTER 11 

Factorial Experiments: 

Basic Ideas 

11.1 INTRODUCTION 

Our discussion so far has centered on error-reduction designs. As we have pointed 
out earlier (see Chapter 2) ， however, another component of experimental design is the 
treatment design. Here we shall be concerned especially with the situation where the 
treatments have a structure, more specifically, a factorial structure. 

Suppose we have several factors, denoted by A, B. C,.... Each factor has a number 
of different expressions, or levels. For example, factor A may be “insecticide” and the 
levels represent different commercially available insecticides, labeled ai, a 2 ,..., a a ; 
factor B may be “amount of insecticide” with the levels representing the specific 
amounts, say 1.2.3,..units, generally denoted by bi, … 5 factor C may be 
“type of application” with the levels “manually” and “mechanically,” generally de¬ 
noted by ci, C 2 ,.... c c . In this example the levels of factors A and C are qualitative, 
whereas the levels of factor B are quantitative. A treatment now consists of level com¬ 
binations, one level from each factor, which we denote by (a^jCfc). These treatments 
are then applied in any of the error-control designs we have discussed earlier. The ob¬ 
ject is not so much to compare the treatments as such but to make statements about 
the “behavior” of the various factors, singly or jointly. We may ask, for example, “Is 
there a difference among the insecticides generally, that is, averaged over the levels of 
factors B and CT\ or “Do the differences in the efficacies of the insecticides depend 
on the type of application?” The first question is one about the main effects of factor 
A, whereas the second question is concerned with the interaction between factors A 
and C. It is these types of questions and the fact that we can provide answers to them 
that make factorial experiments particularly valuable. 

Factorial experiments can be used in various forms (for instance, Kempthome, 
1952): One procedure would be to estimate the effect of, say, factor A, keeping all 
the other factors at a constant level in one experiment; then estimate the effect of factor 
A after changing the level of factor B, and keeping the remaining factors at a constant 
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level in the next experiment; and so on. This procedure of varying one factor at a time 
would generally be used when the purpose is to establish a fundmental law as it would 
lead to detailed knowledge of the effect of one factor when the others are held constant. 
No information is, however, obtained on the dependence of the effects of a factor on 
the levels at which the other factors were held constant. To obtain such information 
we might use another experimental procedure, namely to vary the levels of each of the 
factors and consider all possible level combinations simultaneously. This would allow 
us to obtain information about main effects and, more importantly, about interactions 
among the various factors. These ideas and their practical applications in agronomic 
experimentation and more generally in scientific experimentation were introduced by 
Fisher (1935) and Yates (1937). 

The value of factorial experiments lies in the fact that we look at several factors 
simultaneously which allows us to estimate the various effects and interactions and at 
the same time provides us with a wider inductive basis, that is, drawing conclusions 
over a wide range of conditions. Fisher (1935) has referred to this property as greater 
comprehensiveness. And even though, in general, we can estimate all possible inter¬ 
actions among factors, it is an empirical fact that the interactions among many factors 
(the so-called higher order interactions) are negligible for all practical purposes. This 
leads to a considerable reduction in the number of parameters, that is, main-effects and 
lower order interactions, and hence to an easier interpretation of data from a factorial 
experiment. It is for these reasons that factorial experiments are used widely in scien¬ 
tific and industrial experimentation. In the following sections we shall present some 
basic ideas about certain types of factorial experiments. A much more detailed and 
technical discussion will be given in Chapters II.7-16. 


11.2 INFERENCES FROM FACTORIAL 
EXPERIMENTS 

Suppose we have n factors A\ 9 .. • ， A n> where factor Ai has rrii levels an, a ^， 

..•，= 1,2,..., n). A treatment combination is denoted by (auci 2 jCi 3 k ... a n i) 
and there are such treatment combinations. We write the effect of a treatment 

combination as 丁讲 ...1 and define the various main effects and interactions through an 
expansion of the type as illustrated for n = 3: 

丁 ijk = = T... + ( 亍 i. ， — 亍 …) + ( 亍 .j.- 亍 … ) 

+ (fij. -fi..- Tj. + T..) + ( 亍 “k — 于 …) 

+ (n.k - fi,. - f.fc + f..) 

+ {〒.jk — 亍 .j. — + f_") 

+ _ 丁 ij. _ 丁 i.k _ 丁 .jk + 丁 i.. + 下 , j. + 丁 .，k — 〒 …) . (11.1) 

This is, of course, an identity in 丁啡 which gives rise to a model statement of the form 

丁 ijk = " + A-u + A2j + {A\A2)ij + Ask 

+ (AiA3)ifc + ( 乂 2 乂 3 )并 + (11.2) 
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where /i represents the overall mean; Au ， ^ 3 ^ represent the main effects as¬ 
sociated with the factors Ai, A 2 , A^\ (AiA 2 )ij, (AiAs)ik, ( 乂 2 乂 3 )作 represent the 
two-factor interactions associated with those factors, and (A\A 2 As)ijk represents the 
three-factor interaction. From (11.1) it follows immediately that 


mi 

i=l 


m2 m3 

=H Ao -i = A3k = 0 

j=l k=l 


mi 

y^(AiA 2 )ij 


m 2 

= 〉 ~ 0 ， etc. 

3=1 


and 

mi m2 m3 

^ : ( 乂 1^~2^^3)。./0 ~ 〉 A2A , ^)ijk — > : ( 乂 1 乂 2 乂 3)ij/c = 0. 

i=l j=l k=l 

We have thus exploited the factorial structure of the treatments and decomposed the 
treatment effects into meaningful components. Of these the main effects and two-factor 
interactions (or first order interactions) are of major importance for the interpretation 
of data from such an experiment. The existence of higher order interactions of appre¬ 
ciable magnitude (relative to the main effects) makes the interpretation, unfortunately, 
much more difficult. As mentioned earlier, it is, however, an established fact that the 
importance, that is, magnitude, of higher order interactions tends to decrease as the 
number of factors involved increases (somewhat analogous to a Taylor series expan¬ 
sion). This fact will actually be exploited in later chapters (see also Chapters II.8-16) 
to construct useful incomplete block designs for factorial experiments. 

Model (11.2) leads to a corresponding partitioning of the treatment sum of squares 
into main effect and interaction sums of squares as follows: 


Source d.f. 


Treatments 

m i m2 m3 — 1 

a 

mi — 1 


m2 - 1 

A\ x A 2 

(mi - l)(m 2 - 1 ) 

M 

m3 — 1 

A\ x 

(mi - l)(m 3 - 1 ) 

A 2 X A3 

(m 2 - 1)(7773 - 1) 

Ai x A 2 x A 3 

(mi - l)(m 2 - l)(m 3 - 1) 


If the error-control design is an orthogonal design (CRD, RCBD, GRBD ， LSD) then 
the various components in ( 11 . 2 ) are estimated by replacing in ( 11 . 1 ) the by 
the corresponding treatment means. The partitioning of the treatment sum of squares 
is then obtained by squaring each term on the right-hand side of ( 11 . 1 )，with the t’s 
replaced by the treatment means, and summing each term over all subscripts. Tests of 
hypotheses are performed in the usual manner by using MS ( 五 ) from the error-control 
design as the denominator in the F-statistic and the individual main effect or interaction 
sums of squares in the numerator. 
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For nonorthogonal designs (for example, incomplete block designs) we may set up 
a correspondence between ordinary treatments and factorial treatments to assign the 
treatment combinations to the various blocks. To estimate the main effects and interac¬ 
tions we replace in (11.1) the Tijk shy the corresponding LS means. The (partial) sums 
of squares associated with main effects and interactions must be obtained by using the 
methods of Chapter 4, that is, by fitting full and reduced models. 

Factorial experiments are most useful in exploratory work where the researcher is 
interested in investigating the effects of possibly a large number of factors over a certain 
range of levels and to find out whether the factors act additively, that is, independently, 
or whether they exhibit interaction. It is the broad picture here that is of primary interest 
and the researcher will have to use his subject-matter knowledge to select the treatment 
factors and determine their levels to be included in the experiment. Once a broad 
picture has been obtained then a more detailed study of factors judged to be important 
may be appropriate as a follow-up. 

11.3 EXPERIMENTS WITH FACTORS AT 
TWO LEVELS 

One of the disadvantages, from a practical point of view, of factorial experiments is 
the fact that the number of treatment combinations increases rapidly as the number 
of factors and/or levels increases. One way out of this dilemma is to consider only 
a subset of all possible treatment combinations, a so-called fractional factorial (see 
Section 11.5 and Chapters 11.13, 14). Another possibility is to consider a reasonable 
number of factors and restrict for each factor the number of levels to 2. Those two 
levels may be chosen so that they cover, in some sense, the practical range of levels ， 
whereas for other factors they represent the only possible levels. Suppose we have n 
such factors. Then we refer to this experiment as a 2 n factorial. 

Although a 2 n factorial is commonly used we should emphasize that it is most 
useful as an exploratory experiment. This is particularly true if a factor admits more 
than two levels. If we restrict ourselves to two levels only, then we cannot examine 
the nature of the main effects and interactions in any detail, for example, in the case 
of quantitative factors we cannot examine trends other than linear, thus our earlier 
recommendation of follow-up studies of a smaller nature. 


11.3.1 Definition of Main Effects and Interactions 

We shall now consider briefly the definition of effects and interactions for a 2 n factorial 
as well as the estimation and testing of such effects. To keep the notation simple we 
shall illustrate the concepts for the special case n = 3. Extension to the general case 
should then be obvious. 

Let us denote the three factors by A, B, C, and their levels by ao, ai ， 6 o ， , Q}, Ci ， 
respectively. The eight treatment combinations can then be written (in standard or- 



11.3. EXPERIMENTS WITH FACTORS AT TWO LEVELS 


423 


der) as 


cioboCo 
ciiboco 
ciobico 
dibiCo 
ctoboCi 
a 160 ci 
a 。 61 ci 
d\b\C\ . 

It is convenient to use the same notation also for the true response of those treatment 
combinations (from the context it should always be clear what is meant). We then 
define the following simple effects of A, denoted by A(bj,Ck) 9 as the effect of factor 
A when changing A from level ao to level a 1 with factor B at level bj and factor C at 
level c/c : 

A{bQ. Co) = aiboCo — ao&oQ) 

A(bi,c 0 ) = aibico - a 0 6ic 0 
A(6o ， c i) = ^i^o c i ~ dobo^i 
A(bi,ci) = aibiCi - a 0 biCi ， 

Using the definitions (11.3) we define the main effect A as 

A = \ y^A(bj,c k ) 
j,k 

— 4 I 〉 : a^bjCk - 〉 ： a 。 6 j 

\j，k 3^ 

that is, A represents the average change in response when a 0 is changed to Gi. Sym¬ 
bolically, we express (11.4) as 

义 = i( a i 一 a o)(^i + 知 ） (Ci + Co ) ， （ 11.5) 

where this expression is meaningful only when the right-hand side is multiplied out 
formally and the terms in that expression are interpreted as the true responses from the 
respective treatment combinations. 

We can also define the effect of A when B is kept at level bj and C is averaged 
over levels Co and c\ as 

A(bj,c) = l[A(bj,c 0 ) + Aibj,^)} 

_ k k . 



(11.3) 


(11.4) 


or, symbolically. 


A(bj,c) = \{ai - a 0 )bj{ci +c 0 ) 


( 11 . 6 ) 
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for j = 0,1. To the extent that A( 6 q* c) and A(b\,c) are different from each other, half 
the difference between them is defined as the interaction between the factors A and B, 
denoted by A x 5 or simply AB, that is, 

AB = ^[A(b u c) - A(b 0 ,c)} (11.7) 

or symbolically, 

■AB = \(cli — ao)( 6 i — bo)(ci + Co). (11.8) 

The factor i in (11.7) is merely a convention so that the denominator in (11.8) repre¬ 
sents, as in (11.5), the number of simple differences among treatment responses. 

Note that AB as defined in (11.8) is the interaction between factors A and B aver¬ 
aged over the levels of factor C. We could, similar to (11.3) consider simple interac¬ 
tions between A and B, defined as 

AB(cq) = Co) -- A( 6 o.. Co)] 

— \{ a i ~ ^o)(^i — ^o)^o 
AB(c\) = |[A( 6 i ， Ci) — A(bo. ci)] 

= — ao)(bi — 6o)ci. 

The difference (apart from the factor \) between these two interactions is a measure of 
the three-factor interaction A x B x C,ov simply ABC, that is, 

ABC = ^[AB{d) - AB(cq)} 

= — ao)(bi — bo)(ci — Co). (H.9) 

In a similar manner we can also define the main effects B and C, and the interac¬ 
tions AC and BC. 

The reader can verify easily that each main effect and interaction represents a con¬ 
trast among the treatment combinations and that these contrasts are orthogonal to each 
other. Apart from the factor 1/4 the contrast coefficients are as given in Table 11.1. The 
reader will notice also that, for example, the coefficients for AB are the products of the 
corresponding coefficients for A and B, and so forth. 

The general rule for writing down expressions like (11.5) ，（ 11.8) ，（ 11.9) and hence 
defining the main effects and interactions for the general 2 n factorial is as follows. Any 
effect or interaction X say, can be represented as 

X 二 士 ao)(&i ± bo)(ci =b c 0 ){di 士 d 0 ) … （11 .10) 

where the sign in each bracket is positive if the corresponding capital letter is not con¬ 
tained in X and negative if it is contained in X, and the whole expression on the 
right-hand side of ( 11 . 10 ) is to be expanded algebraically and interpreted in terms of 
treatment combination responses. Just as illustrated in Table 11.1 for the 2 3 case, the 
main effects and interactions represent here, too, a set of 2 n — 1 orthogonal contrasts. 
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11.3.2 Estimation of Main Effects and Interactions 


To estimate the main effects and interactions we first estimate the treatment effects 
from the error-reduction design used. In the case of orthogonal designs these are, of 
course, simply the treatment means, and this is the only case we shall discuss here. Let 
us denote for n = 3, the treatment mean for the treatment combination (aibjCk) by 
y(aibjCk). In the expressions defining the main effects and interactions, such as (11.4) 
or (11.5), (11.8), (11.9)，we then replace the true responses by the estimated responses, 
that is, the treatment means based on say r observations, where r is the number of 
replications in a CRD, r = bis the number of blocks in a RCBD, r = t is the number 
of rows and columns in a LSD, etc. We then obtain, for example, 


A = \[y(aib 0 co) + y{aibiCo) + y{aib 0 ci) + y{aibid) 
— Vi^oboCo) — y(aob\Co) — y{a^b^c\) — y(aobici)] 


( 11 . 11 ) 


Assuming unit-treatment additivity in the broad sense and using the arguments as 
exposited in previous chapters we obtain immediately 


var 


(i) 


16 


16 


^var(y(ai6jC fe )) + ^ var(5(ao6jCfc)) 
j，k j,k _ 


2 r 




( 11 . 12 ) 


The other main effects and interactions are estimated analogously and each is estimated 
with variance given by (11.12). 

These results are extended easily to the general case of n factors. Using (11.10) we 
then find 




1 


j [sum of 2 n_1 treatment means — sum of remaining 


Table 11.1 Contrast Coefficients for Main Effects and Interactions in 2 3 
Factorial 


Main effect/ 

Interaction ao^oco aibnco anbicn aibicn an 6 nci aibnci an^ici ai 6 ici 


T — < T — * 1 — I 1 — i r~H t—H 
++ + + + + + 

IX IX Tx lx IX IX IX 

IX IX IX IX IX IX lx 

+ I 一+ + -- 

IX 1 —- IX Tx IX Ti Tx 

_ 1 + + I - + 

一 — I i-H 1± t—H Tx 1i 1i 

+ + + I I - I 

1± i — II i —- 1A 

-+ I I + - + 

1- IX IX IX i —- 1 1 

+ 1 I - I + + 

Tx i — ^ Tx t — I 一 -H i — i 

I - + -++_ 


c 


tl >> B ' c c 

^BACABA 


treatment means 


(11.13) 
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and consequently. 


wax(X )= 



r2 n-2 


(11.14) 


11.3.3 Sums of Squares for Main Effects and Interactions 


Since each main effect and interaction represents a contrast among treatments, it is easy 
to obtain for the AN OVA the sum of squares associated with that contrast, say SS(X). 
We know (see Chapter 7, equation (7.4)) that 


SS(X) 


[舒 


var(X)/cr2 
= r2 n ~ 2 [X} 2 

using (11.14). Each SS(X) has 1 d.f. and 

E[SS(X)} =g+r2 n - 2 [X] 2 . 


(11.15) 

(11.16) 


It is, of course, obvious then how the hypothesis Hq ： X = 0 can be tested in the 
ANOVA. * 

The right-hand side of (11.16) also shows that if X is assumed to be negligible then 
SS(X) may be pooled with SS(E) to provide additional d.f. for error. 


11.4 INTERPRETATION OF EFFECTS AND 
INTERACTIONS 

The interpretation of effects and interactions follows closely from the definitions given 
in Section 11.3. For example, for the 2 3 factorial with factors A, B, and C, the main 
effect A is the effect of increasing factor A from the amount ao to the amount ai, 
averaging over all possible level combinations of factors B and C. 

Now suppose we wish to obtain the effect of factor A, averaging over the low and 
high levels of factor B, that is, bo, 6i, but with factor C at the low level, that is, at level 
co. Similar to (11.6) this effect is defined as 

A(b : Co) = \{pj\ — o,o)(bi + bo)co = ^[ctibico + aiboCo — ao&iCo — ao^o c o]* (11.17) 
From the definition of the main effect A, that is, 

A — |(a.i — cio){bi + bo)(c\ + Co) ( 11 . 18 ) 

and the interaction AC, that is, 

AC = l(a.i - a.o)(6i + bo)(ci — cq) (11.19) 

it follows easily, by treating the right-hand sides of (11.18) and (11.19) as algebraic 
quantities, that A{b, c 。） of (11.17) can be expressed alternatively as 

A(b,c 0 ) = A-AC. (11.20) 
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In a similar way we obtain the following: 


A{b,ci)=A + AC 
A(b 0 ,c) = A-AB 
A(b u c) = A + AB 


and for the simple effects defined in (11.3) 


A(b 0 , c 0 ) = A-AB - ACABC 
Alb^co) = A + AB-AC - ABC 
A(6 0 , d) = A-AB-^AC- ABC 
A^.ci) = A + AB + AC + ABC. 


Algebraically, the expressions above can be written as 

A(b. co) = ^4(1 — C) 

A(L Cl ) = A(1^C) 

A(bo,c) = A(1 - B) 

A{b u c)=A(l + B) 

A(b^c 0 ) = A{\-B){l-C) 

A(b u c 0 ) = A(l + B)(l-C) 

A(6 0 , Cl ) = A(1 — B)(l + C) 

A(6i,ci) = A(l + B)(l + C). 

This gives an easy way of remembering them and of writing down the effect of any 
factor for any situation with regard to the other factors. 

Another consequence of having expressions like (11.20) and (11.21) is that we 
can obtain easily estimates of these effects and the variances of these estimates. For 
example, we have 

A(b. Co) = A — AC. 

Since A and AC are orthogonal contrasts among the treatment effects it follows that A 
and AC are uncorrelated and hence 

var[^4(6, co)] = var(^4) + var(AC) 

1 2 
r. 2 3 二 2 6 


A(bo, co) = A — AB — AC ABC 

A 12 
var[i(6 0 ,c 0 )] = 


and, similarly, 
with 
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11.5 INTERACTIONS: A CASE STUDY 

We have mentioned earlier that every treatment design needs to be imbedded in error- 
control design. We have also discussed the possibility of interaction between treatment 
and blocking factors (see Section 9.6.7)，that is, between factors from the set X and the 
sets Z and U (see Section 2.2.4). In this chapter we are considering the situation where 
the set X consists of several factors，say DC = {A l5 A 2 , •. .， 乂 n }. These factors give 
rise to main effects 乂 1 ， A 2 , ..., A n and interactions of the form Aj x Aj(i. j = 1 , 
2, ..n; i 7 ^ j), Ai x Aj x Ak(i ， j., k = 1 , 2, ..n; z, j, k not equal, and so 
on. As a consequence, we now can also envision interactions of the form Ai x Z, 
Ai x U, Ai x Aj x Z, A{ x Aj x U. This list can be extended, of course, but as we 
have mentioned earlier, typically higher order interactions are negligible, or negligible 
from a practical point of view. Rather than discuss these possibilities in generalities we 
shall consider a particular experiment and point out some strategies for exploring the 
existence of the types of interactions mentioned above. 

11.5.1 The Experiment 

The following experiment was discussed by Pearce (1953, 1983) (see also Hinkelmann, 
2004), but for the purpose of this discussion we have made slight modifications and 
have constructed the data (yield) based on summary data given in the article. 

Example 11.1: The objective is to study the effect of different pruning methods on 
the yield of varieties of pears. There are two treatment factors: Ai 三 A = type of 
pruning, A 2 三 B = amount of pruning, each with two levels. For factor A the two 
levels are: F = pruning with few leaders, M = pruning with many leaders, and for 
factor B the two levels are: H = hard pruning, L = light pruning. In order to broaden 
the scope of the study, the investigator included five varieties of pears: Am=Beurre 
d’Amanlis, Ha=Beurre Hardy, Co=Conference, Fe 二 Fertility, Pi=Pitmaston. These 
constitute the five levels of the intrinsic factor z 1 = V = variety. The experiment was 
set up as a randomized complete block design with six blocks for each variety (see 
Figure ??). Thus there is one non-specific factor u\ = j3 = block with six levels (the 
original experiment had eight blocks for each variety). 

Thus, in summary, the experiment is a 2 2 factorial experiment with treatments (F, 
H), (Af, H), (F, L), (M, L) in a randomized complete block design with a nested 
blocking structure 8{V) with 5 x 6 = 30 blocks of size four each. The four treat¬ 
ments were randomly assigned to four experimental units (trees) in each block (see an 
example in Figure 11.1). 

11.5.2 The Model 

Denoting the response to the treatment by y, we can write out a linear model analo¬ 
gous to ( 2 . 2 ) reflecting the treatment and block structures and the type of interactions 
mentioned above, as follows: 
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Block 


Am 


Ha 


Co 


Fe 


Pi 


3 4 5 6 


(L.F) 

(H. M) 
(H,F) 
(L,M) 

(R M) 
(L. M) 
(L*F) 
(H.F) 







_ 



























Figure 11.1 Experimental Layout (Schematic). 


Uijki = ^ Vi Pij + Ak Bi -(AB)ki 
+ (VA) ik + {VB)u + (VAB) ikl 

+ (pA)ijk 4 - (dB)iji + ((3AB)ijki + ^ijku ( 11 . 22 ) 


where 


Vi = effect of i-th variety (i = l, 2, ..., 5) 

Sij = effect of j-th block for i-th variety (j = 1. 2, ..., 5) 
Ak = effect of fc-th type of pruning (k = 1(F), 2(M) 
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Bi = effect of -th amount of pruning (/ = 
(AB)ki = A x B interaction component 

(VA) ik = A x Z interaction component 

(VB) n = B x Z interaction component 
(VAB)iki = A x B x Z interaction component 

(0A)ijk = A xU interaction component 
= B xU interaction component 
(/3AB)ijki = A x B x U interaction component 


刺， 2( L )) 


Based on model (11.22) we can partition the total number of degrees of freedom, 
119 = 120 — 1， in the ANOVA table as given in Table 11.2. We note here that the 
effect terms contained in model (11.22) account for all the d.f., leaving no d.f. for 
error. We have done this on purpose and, in fact, encourage the reader to always write 
out what we might call a full model, that is, accounting for all possible effects and 
interactions and their associated d.f. This will provide a check whether in particular 
we have accounted for all interactions and what, if any, assumptions we need to make 
to obtain an adequate number of d.f. for error (in addition to possibly existing d.f. for 
pure error, such as exist for example in the GRBD (see Section 9.7). 


11.5.3 The Analysis 

We now consider the analysis of the data for the experiment described above. The data 
are given in Table 11.3. (The reader may notice that we have included a factor C, the 
meaning of which will be made clear in comment (iv) regarding Table 11.5). 

Based on model (11.22) and the breakdown of the total d.f. we assume for the 
preliminary analysis that the interaction AB(3 is negligible and hence used as the error 
term. We note that the d.f. associated with A x B x 0 represent only part of the block 
x treatment interaction d.f. 

The analysis is performed using SAS PROC GLM. The input statements and the 
output are given in Table 11.4. We comment on the results as follows: 

(i) The _BxBlock(V) interaction is clearly non-significant (P = .29). 

(ii) The AxBlock(V) interaction is most likely also negligible (P = .16). 

(iii) Based on the results in (i) and (ii) we may thus pool both interaction terms with 
the Ax B x d “interaction” to form the error term with 75 d.f. for future analysis 
purposes. 


(iv) Our new model then becomes 
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Table 11.2 ANOVA for Model (11.22) 


Source of Variation 

Degrees of Freedom 

V 

4 

p 

25 = 5(6 - 1) 

A 

1 

B 

1 

AxB 

1 

V xA 

4 

V x B 

4 

V x AxB 

4 

0xA 

25 

PxB 

25 

p x Ax B 

25 

Total 

119 


Uijkl = fJ^ + Vi (3ij - Ak -{■ Bi -\r {AB)k 

+ {VA) ik + (VB) U + (VAB) ikl + e ijkl .. (11.23) 

The SAS input statements for the ANOVA using model (11.23) and for some 
follow-up procedures are given in Table 11.5. Among these are the slice options 
4 LSMEANS A * B/SLICE = B SLICE = and TSMEANS A * V A * B * 
V7SLICE = V\ With regard to the A * 5 interaction, the slice option tests whether 
the simple effects for A and B, respectively, are significant. In general, the slice option 
tests the equality of the LS means for one factor at the different levels of the other fac¬ 
tor. With regard to the K * ^4 and V * B interactions, the option ‘SLICE 二 F’ enables 
us to test whether the simple effects of A and B are significant for each level of V. We 
note that we did include the option ‘LSMEANS A* B * F/SLICE = V only to show 
that this would result in testing whether the four LS means for (F ， H )， (F, L), (M, H), 
and (M, L) are different from each other for every level of V, and that is of no interest 
to us. 

We now turn to the analysis as presented in Table 11.5 and make the following 
comments: 


(i) The P-values for V and Block(F) should be ignored, since under randomization 
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Table 11.3 A Case Study (Data) 


data pruning; 

input V$ Block A$ BSC Y@@; 
datalines; 


Am 

1 

F 

H 

1 

530 

Am 

1 

F 

L 

2 

581 

Am 

1 

M 

H 

3 

548 

Am 

1 

M 

L 

4 

572 

Am 

2 

F 

H 

1 

523 

Am 

2 

F 

L 

2 

570 

Am 

2 

M 

H 

3 

532 

Am 

2 

M 

L 

4 

571 

Am 

3 

F 

H 

1 

528 

Am 

3 

F 

L 

2 

586 

Am 

3 

M 

H 

3 

539 

Am 

3 

M 

L 

4 

608 

Am 

4 

F 

H 

1 

516 

Am 

4 

F 

L 

2 

604 

Am 

4 

M 

H 

3 

553 

Am 

4 

M 

L 

4 

587 

Am 

5 

F 

H 

1 

558 

Am 

5 

F 

L 

2 

639 

Am 

5 

M 

H 

3 

563 

Am 

5 

M 

L 

4 

615 

Am 

6 

F 

H 

1 

582 

Am 

6 

F 

L 

2 

657 

Am 

6 

M 

H 

3 

580 

Am 

6 

M 

L 

4 

640 

Ha 

1 

F 

H 

1 

534 

Ha 

1 

F 

L 

2 

582 

Ha 

1 

M 

H 

3 

554 

Ha 

1 

M 

L 

4 

619 

Ha 

2 

F 

K 

1 

538 

Ha 

2 

F 

L 

2 

578 

Ha 

2 

M 

H 

3 

543 

Ha 

2 

M 

L 

4 

602 

Ha 

3 

F 

H 

1 

563 

Ha 

3 

F 

L 

2 

599 

Ha 

3 

M 

H 

3 

567 

Ha 

3 

M 

L 

4 

618 

Ha 

4 

F 

H 

1 

567 

Ha 

4 

F 

L 

2 

601 

Ha 

4 

M 

H 

3 

601 

Ha 

4 

M 

L 

4 

629 

Ha 

5 

p 

H 

1 

547 

Ha 

5 

F 

L 

2 

600 

Ha 

5 

M 

H 

3 

607 

Ha 

5 

M 

L 

4 

655 

Ha 

6 

F 

H 

1 

582 

Ha 

6 

F 

L 

2 

636 

Ha 

6 

M 

H 

3 

602 

Ha 

6 

M 

L 

4 

677 

Co 

1 

F 

H 

1 

551 

Co 

1 

F 

L 

2 

604 

Co 

1 

M 

H 

3 

572 

Co 

1 

M 

L 

4 

644 

Co 

2 

F 

H 

1 

545 

Co 

2 

F 

L 

2 

591 

Co 

2 

M 

H 

3 

584 

Co 

2 

M 

L 

4 

647 

Co 

3 

F 

H 

1 

558 

Co 

3 

F 

L 

2 

600 

Co 

3 

M 

H 

3 

587 

Co 

3 

M 

L 

4 

642 

Co 

4 

p 

H 


569 

Co 

4 

F 

L 

2 

614 

Co 

4 

M 

H 

3 

597 

Co 

4 

M 

L 

4 

665 

Co 

5 

F 

H 

1 

598 

Co 

5 

F 

L 

2 

648 

Co 

5 

M 

H 

3 

618 

Co 

5 

M 

L 

4 

660 

Co 

6 

F 

H 


612 

Co 

6 

F 

L 

2 

651 

Co 

6 

M 

H 

3 

638 

Co 

6 

M 

L 

4 

699 

Fe 

1 

F 

H 

1 

575 

Fe 

1 

F 

L 

2 

610 

Fe 

1 

M 

H 

3 

590 

Fe 

1 

M 

L 

4 

655 

Fe 

2 

F 

H 

1 

554 

Fe 

2 

F 

L 

2 

630 

Fe 

2 

M 

H 

3 

605 

Fe 

2 

M 

L 

4 

638 

Fe 

3 

F 

H 

1 

576 

Fe 

3 

F 

L 

2 

648 

Fe 

3 

M 

H 

3 

608 

Fe 

3 

M 

L 

4 

643 

Fe 

4 

F 

H 

1 

595 

Fe 

4 

F 

L 

2 

653 

Fe 

4 

M 

H 

3 

631 

Fe 

4 

M 

L 

4 

656 

Fe 

5 

F 

H 

1 

609 

Fe 

5 

F 

L 

2 

652 

Fe 

5 

M 

H 

3 

641 

Fe 

5 

M 

L 

4 

686 

Fe 

6 

F 

H 

1 

597 

Fe 

6 

F 

L 

2 

652 

Fe 

6 

M 

H 

3 

660 

Fe 

6 

M 

L 

4 

689 

Pi 

1 

F 

H 

1 

600 

Pi 

1 

F 

L 

2 

661 

Pi 

1 

M 

H 

3 

625 

Pi 

1 

M 

L 

4 

702 

Pi 

2 

F 

H 

1 

606 

Pi 

2 

F 

L 

2 

641 

Pi 

2 

M 

H 

3 

635 

Pi 

2 

M 

L 

4 

675 

Pi 

3 

F 

H 

1 

610 

Pi 

3 

F 

L 

2 

643 

Pi 

3 

M 

H 

3 

642 

Pi 

3 

M 

L 

4 

670 

Pi 

4 

F 

H 

1 

609 

Pi 

4 

F 

L 

2 

672 

Pi 

4 

M 

H 

3 

653 

Pi 

4 

M 

L 

4 

684 

Pi 

5 

F 

H 

1 

632 

Pi 

5 

F 

L 

2 

694 

Pi 

5 

M 

H 

3 

669 

Pi 

5 

M 

L 

4 

723 

Pi 

6 

F 

H 

1 

655 

Pi 

6 

F 

L 

2 

714 

Pi 

6 

M 

H 

3 

676 

Pi 

6 

M 

L 

4 

727 


run; 




Table 11.4 A Case Study (Preliminary ANOVA) 


a) Input statements: 


proc glm data=pruning; 
class V Block ABC; 

model Y=V Block(V) A B A*B V*A V*B V*A*B A*Block(V) B*Block(V); 

run; 


b) Output: 


The GLM Procedure 


Class Level Information 


Class 

V 

Block 

A 

3 

C 


Levels 

5 

6 

2 

4 


Values 

Am Co Fe Ha Pi 
1 2 3 4 5 6 
F M 
H L 

12 3 4 


Number of Observations Read 120 

Number of Observations Used 120 


Dependent Variable : Y 


Source 

DF 

Sum of 
Squares 

Mean Square 

F Value 

Model 

94 

261663.3833 

2783.6530 

30.58 

Error 

25 

2275.4167 

91.C167 


Corrected Total 

119 

263938.800C 




R-Square 

Coeff Var 

Root M5E 

Y Mean 

0.991379 

1.556578 

9.540266 

612.S0C0 


Source 

DF 

Type III SS 

Mean Square 

F Value 

V 

4 

102743.C50C 

25685.7625 

282 

.21 

Block(V) 

25 

5C015.25C0 

20C0.7700 

21 

• 98 

A 

1 

18451,2000 

18451.2000 

2C2 

. 72 

B 

1 

78540.8333 

7854C.8333 

862 

• 93 

A*E 

1 

108.3000 

108.30C0 

1 

.19 

V»A 

4 

3750.7167 

937.6792 

10 

• 30 

V*B 

4 

306.5833 

76.6458 

0 

.84 

V*A*B 

L 

1494.7833 

373.6958 

4 

.11 

Block*A(V) 

25 

3415.5833 

136.6233 

i 

.50 

BlockxB(V) 

25 

2833.C833 

113.3233 

1 

.25 


Pr > F 

<.0001 


?r > F 

< . 0001 
<.0001 
<.0001 
<.0001 
0.2858 
<.0001 
0.5117 
0.0108 
0.1582 
C .2939 
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Table 11.5 A Case Study (ANOVA and Post-hoc Analysis) 


a) Input statements: 

proc glm data=pruning; 
class V Block A B; 

model Y=V Block(V) A B A*B A*V B*V A*B*V/SS3; 

lsmeans A B A*B/slice=B slice=A; 

estimate ’Main effect A’ A -1 1; 

estimate ’Main effect B’ B -1 1; 

lsmeans V A*V B*V A*B*V/slice=V; 

run; 

b) Output: 


The GLM Procedure 

Dependent Variable : Y 


Source 


DF 

Sum of 

Squares Mean Square 

F 

Value 

Model 


44 

255414.7167 

5804.8799 


51.07 

Error 


75 

8524.0833 

113.6544 



Corrected 

Total 

119 

263938.8000 





R-Square 

Coeff Var Root MSE 

Y 

Mean 



0.967704 


1.739417 10.66088 

612 . 

,90C0 



Source 

DF 

Type III SS 

Mean Square 

F Value 

V 

4 

102743.0500 

25685.7625 

226.00 

Block(V) 

25 

50019.2500 

2000.7700 

17.60 

A 

1 

18451.2000 

18451.2000 

162.34 

a 

1 

78540.8333 

78540.8333 

631.05 

A*B 

1 

108.3000 

108.3000 

0.95 

V*A 

4 

3750.7167 

937.6792 

8.25 

V*B 

4 

306.5833 

76.6458 

0.67 

V*A*E 

4 

1494.7833 

373.6958 

3.29 


Least Squares Means 


A Y LSMEAN 

F 


Pr > F 

<.0001 


Pr > F 

<.0001 
< . 0001 
<.0001 
<•00C1 
0.3321 
<.0001 
0.6118 
0.0154 


M 


600.500000 

625.300000 



Table 11.5 (Continued) 


Am 

1 

48 

Co 

1 

7072 

Fe 

1 

5133 

Ha 

1 

5017 

Pi 

1 

4930 


166667 48.166667 
666667 7072.666667 
375000 5133.375000 
041667 5017.041667 
666667 4330.666667 


42 0.5170 
23 <.0C01 
17 <.000 ： 
14 <.0001 
38 <.0001 


Least Squares Means 

B Y LSMEAN 

H 587.316667 

L 638.483333 


A B Y LSMEAN 

F H 573.966667 
F L 627.033333 
M H 600.666667 
M L 649.933333 


A*B Effect Sliced by B for Y 
Sum of 

B DF Squares Mean Square F Value Pr > F 

H 1 10693 10693 94.09 <.0001 

L 1 7866.150000 7866.150000 69.21 <.0001 


A*3 Effect Sliced by A for Y 
Sum of 

A DF Squares Mean Square F Value Pr > F 

F 1 42241 42241 371.66 <.0001 

M 1 36408 36408 320.34 <.0001 


V Y LSMEAN 

Am 574.250000 
Co 612.25000(3 
Fe 627.208333 
Ha 591.708333 
Pi 659.083333 


V A Y LSMEAN 

Am F 572.833333 

Am M 575.666667 

Co F 595.083333 

Co M 629.416667 

Fe F 612.583333 

Fe M 641.833333 

Ha F 577.250000 

Ha M 606.166667 

Pi F 644.750000 

Pi M 673.416667 

V*A Effect Sliced by V for Y 

Sum of 

V DF Squares Mean Square F Value Pr > F 




Table 11.5 (Continued) 


6607.277778 

8078.277778 
6398.486111 
6575.152778 

6558.277778 


13 <.0C01 
08 <.0001 
30 <.OCOl 
85 <.C001 
70 <.0001 


standard 

Error t Value Pr > 


Am 

3 

19822 

Co 

3 

24235 

Fe 

3 

19195 

Ha 

3 

19725 

Pi 

3 

19675 


Dependent Variable : Y 

Parameter Estimate 


V 

E 

Y LSMEAN 

Am 

H 

546.000000 

Am 

L 

602.500000 

Co 

H 

585.7500C0 

Co 

L 

638.75C000 

Fe 

H 

603.416667 

Fe 

L 

65I.OOOOCO 

Ha 

H 

567.083333 

Ha 

L 

616.333333 

?i 

H 

634.333333 

Pi 

L 

683.833333 


V*3 Sffecu Sliced by V for Y 


V 

DF 

Suit, of 
Squares 

Mean Square 

F Value 

Pr > F 

Am 

1 

19154 

19154 

168.52 

<.0001 

Co 

1 

16854 

16854 

148.29 

<.00C1 

Fe 

1 

13585 

13585 

119.53 

<.C001 

Ha 

i 

14553 

14553 

128.05 

<.0001 

Pi 

1 

14*702 

14702 

129.35 

<.0001 


V A B Y LSMEAN 

Am F H 539.500000 
Am F L 606.166667 
Am M H 552.50000C 
Am M L 598.833333 
Co ? H 572.166667 
Co F L 618.0C000C 
Co M H 599.333333 
Co M I 659.500000 
Fe F H 584.333333 
?e F L 64C.833333 
Fe M H 622.500000 
Fe M L 661.166667 
Ha F H 555.166667 
Ha F L 599.333333 
Ha M H 579.000000 
Ha M L 633.333333 
Pi F H 618.666667 
Pi F L 670.833333 
Pi M H 650.000C00 
Pi M L 696.833333 


V*A*3 Effeci Sliced by V for Y 
Sum of 

V DF Squares Mean Square F Value Pr > F 


Main effect A 
Main effect 3 


24.8000000 
51.1666667 


1.9464C2I9 
1.94640219 


12.74 

26.29 


<.0001 

<.0001 
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theory no significance tests for block effects, that is, effects of intrinsic and non- 
specific factors, are permissible. 

(ii) The A 氺 B interaction is non-significant (P = .33). Thus there is no real need to 
invoke the slice option. We have included it, however, for purposes of illustra¬ 
tion. Using the A * B LS means and the definition of simple effects in Section 
11.3.1 we find the estimates of the simple effects to be 


and 


A{H) = 600.67 - 573.97 = 
A{L) = 649.93 - 627.03 = 

B(F) = 627.03 — 573.97 = 
B{M) = 649.93 - 600.67 = 


26.7 

22.9 

53.06 

49.26 


and all are significantly different from zero (P < .0001) and so are the estimates 
of the main effects A = 24.8 and B = 51.17 with standard error 1.95. At the 
same time, the test for ^4 * 召 interaction indicates that the simple effects for A 
as well as those for B are not different from each other, leading to near-parallel 
lines in the interaction plot. 

(iii) The interactions A^V and A * 5 * y are significant (P < .0001) and P = 
.015, respectively). The results for the A * y interaction sliced by V indicate 
that only the simple A-effects for variety Am are not different from each other, 
whereas the estimates of the simple A-effects for the other four varieties are of 
the same order of magnitude, around 30, as can be seen from the A^V 乙 S means. 
This interaction is clearly a codirectional interaction, and hence considering the 
overall A main effect is appropriate. Furthermore, the slice operation shows also 
that the A x F interaction comes about only because of the different behaviour 
of variety Am. If we were to drop Am from the analysis, there would be no 
AxV interaction. Since the B xV interaction is not significant, the B^V slice 
operation is not really needed. The B LS means show that the estimators of 
the simple S-effects are all about the same order of magnitude, around 50. 

(iv) Concerning the A x B x V interaction, we have included the slice operation 
with A^B^V effect sliced by V only to demonstrate that this operator does 
not produce the desired results for three-factor interactions. We would like to 
compare the simple A x B interactions (as defined in Section 11.3.1), but the 
results of the slice operation indicate that this procedure tests, for each variety, 
the equality to the four B LS means, as indicated by DF = 3. To achieve 
our objective we now use the factor C introduced earlier, noting that the contrast 
vector (1, -1, —1 ， 1) for C describes the A x B interaction. More specifically, 
we use the SAS input statement and the results in Table 11.6 as follows. 

We note that from Tables 11.5 and 11.6 it follows that the following relationships 
among sums of squares exist: 

SS(C) = SS(A) + SS(B) + SS(A * B) 
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<. 0001 
<.0001 
<.0001 
<.0001 


V 

4 

1C2743.0500 

25685.7625 

Block(V) 

25 

5C019.2500 

2000.7700 

C 

v*c 

3 

97100.3333 

32366.7778 

12 

5552.0833 

462.6736 


Least Squares Means 

VC Y LSMEAN 

Am 1 539.500000 

Am 2 606.166667 

Am 3 552.500000 

Am 4 598.833333 

Co 1 572.166667 

Co 2 618.C00000 

Co 3 599.333333 

Co 4 659.500COO 

Fe 1 584.333333 


Table 11.6 A Case Study (Additional Post-hoc Analysis) 


a) Input statements: 

proc glm data=pruning; 
class V Block C; 

model Y=V Block(V) C V*C/SS3; 
lsmeans V*C; 

estimate ’A*B for Am’ C 1 -1 -1 1 V*C I -i -1 l/divisor=2; 
estimate ， A*B for Co’ C 1 -1 -1 1 V*C0000 1-1-1 l/divisor=2; 
estimate’A*B for Fe 1 C 1 -1 -1 1 V*C 0 0 0 0 0 0 0 0 1 -1 -1 
l/divisor=2; 

estimate ’A*B for Ha，C 1 -1 -1 1 V*C 0 0 0 0 0 0 0 0 0 0 0 0 1 -1 -1 
l/divisor=2; 

estimate ’A*B for Pi’ C 1 -1 -1 1 V*C 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 

-1-1 l/divisor=2; 

run; 

b) Output: 


The GLM Procedure 

Dependent Variable : Y 


Sum of 


Source 

DF 

Squares 

Mean Square 

F Value 

Pr > F 

Model 

44 

255414.7167 

5804 _8799 

51.07 

<■0001 

Error 

75 

8524.0833 

113.6544 



Corrected Total 

119 

263938.8000 





R-Square Coeff Var Root MSE Y Mean 

0.967704 1.739417 10.66088 612.9000 

Source DF Type III SS Mean Square F Value Pr > F 


0 0 8 7 
0 6 7 0 

6 7 4 4 
2 18 
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Dependent Variable : Y 

Parameter 


Pr > |tI 

0.0222 
0 _ 1038 
C .0440 
0.2465 
0.5419 


Estimate 

-10.1666667 

7.1666667 

-8.9166667 

5.0833333 

-2.6666667 


Standard 

Error 

4.35228761 
4.35228761 
4.35228761 
4.35228761 
4.35228761 


Table 11.6 (Continued) 


Fe 

2 

640.833333 

Fe 

3 

622.500000 

Fe 

4 

661.166667 

Ha 

1 

555.166667 

Ha 

2 

599.333333 

Ha 

3 

579.000000 

Ha 

4 

633.333333 

Pi 

1 

618•666667 

Pi 

2 

670.833333 

Pi 

3 

650.000000 

Pi 

4 

696.833333 


and 

SS(V * C) = SS(V * A) + SS{V * B) + SS{V * A*S) 

with 3 and 12 d.f., respectively. From SS(C) + SS(V * C) with 15 d.f. the 
estimate statements in Table 11.6a isolate 5 d.f. which specify the simple B 
interactions for each variety. The results in Table 11.6 indicate that only the A^B 
interaction for Am is clearly significant (= .022). The A 木 B interaction for Fe is 
borderline significant (P = .044), whereas the other A^B interactions are not 
significant. This shows again that the variety Am behaves somewhat differently 
than the other varieties. A closer look at the V * A * 5 LS means confirms this 
finding as the highest yield for all varieties except Am is achieved for the (M, 
L) treatment combination. For Am the highest yield is obtained for the (F ， L) 
treatment combination, but the difference between the yields for (M, L) and (F ， 
L) is relatively small. 

(v) Thus, the overall conclusion from this study shows that over a wide range of 
pear varieties the method of light pruning with many leaders will lead to the best 
results. Possible exceptions are for varieties similar to Am, where light pruning 
with few leaders may produce slightly better results. □ 

11.5.4 Separate Analyses 

The follow-up procedures as described above are done within the context of the overall 
analysis using model 11.23. And this is the method we recommend in general. One 



4 5 5 7 1 
3 6 0 16 
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reason for proceeding in this way is that all inferences are based on the same error term, 
namely MS(Error) from the overall ANOVA, usually based on a sufficient number of 
degrees of freedom. 

An alternative procedure, however, might be to perform separate analyses for each 
level of one intrinsic factor or for each level combination of several intrinsic factors. 
Since in our example theV x A x B interaction is significant, we might be led to five 
analyses, each based on the model 

Y^ = ,^+0f + 4 ) + Bl' ) + (AB)^+e% 

for i = 1 ， 2 ， … ， 5. Although we would be able to make then recommendations 
separately for each variety, it becomes more difficult to arrive at statistically sound 
overall conclusions. We shall not provide the details of the five analyses here, but only 
report that the overall result would have been the same as obtained in (v) above. 


11.5.5 Blocking by Intrinsic Factor Only 

The experiment described and analyzed in the preceding sections uses an RCBD with 
a nested blocking structure and a factorial treatment structure. We have used these 
structures to investigate various forms of interactions. Generally speaking, we have 
considered interactions of the form X x X x U, X x U, X x X x Z, X .x Z, X x X. 
The absence of the X x X x U and X x U interactions provided us with an appropriate 
error term to perform the analysis in Section 11.5.3. 


Example 11.2 ： 

In this section we shall consider the situation if the experiment had consisted of just 
one block for each variety. In that case “variety” is the only blocking factor. In other 
words, the intrinsic factor is the only blocking factor. 

The typical approach to analyzing data from such an experiment would be to as¬ 
sume that the treatment x block interaction is negligible and then to use the model 


Yijk = f^t + Vi Aj + Bk + (AB)jk 4 - eijk. (11.24) 


We know, however, from the analysis of he larger experiment that the V x A and 
V x A x B interactions were significant there. Thus, the assumptions that lead to 
model (11.24) may not be appropriate. 

In general, we do not have this kind of insight, but whenever an intrinsic factor is 
used as a blocking factor careful consideration must be given to possible existence of 
X x 2. interactions. Usually such considerations have to be based on subject matter 
knowledge rather than on statistical arguments since there may only exist limited test¬ 
ing for the X x 2, interaction as given by, for example, Tukey (1949) and Mandel (1961) 
(see Section 9.6). 

In our example, the treatments have a factorial structure. Therefore the X x Z 
interaction can be divided into A x V, B x V, and A x B x V interactions. This 
provides us with a choice whether to assume that all three interaction components or 
only one or two of them are negligible. 
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An obvious choice would be to assume that A x B x V is negligible and to use 
SS(A x B x V) 8 lS the error sum of squares, SS(£), with 4 d.f. In our case we happen 
to know, however, that A x B x V may not be negligible and hence we may not be 
willing to follow this route. Instead we shall propose here an ad-hoc approach to this 
potential problem. 

11.5.6 Using the Half-normal Plot Technique 

We shall adapt the method of half-normal plots which was proposed by Daniel (1959) 
to identify non-zero effects in a saturated fraction of a 2 n factorial. Saturated in this 
context means that the design does not provide any d.f. for error. This is the same 
situation here if we are not willing, a priori, to assume that some of the X x Z inter¬ 
actions are negligible. The method essentially consists of plotting the absolute values 
of the estimates of interactions and main effects with increasing magnitude on half¬ 
normal probability paper, and if the values are all zero they should lie on a straight line. 
Estimates with “large” deviations from this line are considered to be non-zero, that 
is, nonnegligible (for a description see Daniel (1959), Zahn (1975), and also Section 
II. 13.9). In order to use this method for our purpose we need to partition the X x Z 
interaction into single>d.f.-contrasts. In general, if X has v x d.f. and 2. has v z d.f., then 
X x Z has v x - v z d.f. Thus, there will be v x - v z contrasts which will have to be or¬ 
thonormal, that is, orthogonal and normalized, for this procedure to work. We shall use 
our example to describe how this can be accomplished. The general procedure should 
then become obvious. 

We have u x = 3 and v z — 4, where the 3 d.f. for X are represented by those for 
A, B, and AB, and the 4 d.f. for Z by four comparisons among the five varieties, 
denoted by VI, V2, V3, V4 say. For VI, V2, V3, V4 we choose the complete set 
of four orthogonal polynomials among the five varieties. The contrast coefficients for 
these orthogonal polynomials are given in Table 11.7 (see Section 7.4). We should 
note that these contrasts have no particular meaning here since the levels of V are 
nominal, but that they were chosen conveniently for mathematical purposes only; other 
contrasts could have been chosen just as well as long as they are orthogonal. The 
seven sets of contrast coefficients are given in Table 11.7, labelled Vl f V2, V3, V4, 
A, B, AB. The coefficients for the 12 contrasts belonging to the X x 2. interactions 
are then simply obtained by multiplying the coefficients for the corresponding X and Z 
contrasts. For example, the coefficients for the contrast VIA is obtained by multiplying 
elementwise the coefficients for V^l and A. For the set of contrasts labelled VIA, V 2A, 
.• •，V A AB we also give the normalizing divisor (ND), which is the square root of the 
sum of the squared coefficients. We then obtain the contrast estimates and plot their 
absolute values on half-normal probability paper. The results, using for each variety 
the observations given in block 5, are given in Figure 11.2. 

Inspection of Figure 11.2 shows that the absolute values for VIA, V2A, V3A, 
V4A, and V2B do not lie on the line going through the smaller contrast values. This 
implies that, at least informally, these contrasts may not be negligible and, hence, prob¬ 
ably should not be included in the error term. 
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Value 

Figure 11.2 Half-normal Plot for V*A*B Single-df Contrasts. 


However, since the contrast V2B represents only one d.f. of the 4 d.f. for the 
interaction B xV, it may be quite appropriate from a practical point of view to declare 
B xV negligible and use the error term 

SS(Error*) = SS(B x V) + SS{AB x V). (11.25) 

The form of the error term (11.25) implies that the data should be analyzed according 
to the model 

Yijk = ^ Vi + Aj -\- Bk (AB)jk + (VA)ij + e*j k . (11.26) 

11.5.7 The Analysis 

The analysis of the data using model 11.26 is presented in Table 11.8. We comment 
briefly on the SAS PROC GLM output: 

(i) The main effects A and B are significant with P < .0001. 

(ii) The interaction A * 5 is not significant (P = .2023). 





asuaujacl 


(iii) The A ^ V interaction is significant (P = .0031) as suggested already by the 
half-normal plot of Figure 11.2. 
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Table 11.8 Blocking by Variety Only 


a) Input statements: 

Data block5; 

inputVBlock ABC Y @@； 
datalines; 


15 6 8 2 
6 6 6 6 7 


m a o e .1 
A H c F p 

3 7 s 1 9 
6 0 14 6 


M MMM M 


m a o e i 
A H c F p 

9 0 8 2 4 
3 0 4 5 9 


-Tla O e i 
A H c F ? 

8 7 8 9 2 
5 4 9 0 3 


n a o e 
A H c F _ 


0.987982 


1.225336 


7.728195 


63C.7000 
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F H 

F L 

M H 

ML 


588.800000 

646.600000 

619.600000 
667.800030 


3.456154 

3.456154 

3.456154 

3.456154 


V 

A 

Y LSMEAN 

Standard 

Error 

Fr > |t | 

Am 

F 

598.50C00C 

5.464659 

<.0001 

Am 

M 

589.000000 

5.464659 

<•0001 

Co 

F 

623.00C000 

5.464659 

<.00C1 

Co 

M 

639.000000 

5.464659 

<.0001 

Fe 

F 

630.500000 

5.464659 

<.0001 

Fe 

M 

663.500000 

5.464659 

<.0002 

Ha 

F 

573.500000 

5.464659 

〈 •0G01 

Ha 

M 

631.000000 

5.464659 

<.0001 

Pi 

F 

663.000000 

5.464659 

<•0001 

Pi 

M 

696.0000C0 

5.464659 

<.0001 


V*A Effect Sliced by V for Y 




Sum of 




V 

DF 

Squares 

Mean Square 

F Value 

Fr > F 

Am 

1 

90.25C000 

SG.25G00C 

1.51 

0.2539 

Co 

1 

256.000000 

256.000000 

4.29 

0.0722 

Fe 

1 

1089.000000 

108S.OCOOOO 

18.23 

0.0027 

Ha 

1 

3306.250000 

3306.250000 

55.3 6 

<.0001 

Pi 

1 

1089.000000 

1089.000000 

18.23 

0.0027 


Table 11.8 (Continued) 


Source 

DF 

Type III SS 

Mean Square 

F Value 

?r > F 

V 

4 

19287.70C00 

4821.92500 

80.74 

<•0001 

A 

1 

3380.00000 

3380.00000 

56.59 

<.0001 

B 

1 

14045.0C00C 

14045.00000 

235.16 

<.0001 

A*3 

1 

115.20000 

115.20000 

1.93 

0.2023 

V*A 

4 

2450.50000 

612.62500 

10.26 

0.0031 


Least Squares Means 


A 

Y LSM2AN 

Standard 

Error 

Pr > 

It 1 

F 

617.700000 

2.44387C 

<■0001 

M 

643.700000 

2.443870 

<.0001 

B 

Y LSMEAN 

Standard 

Error 

Pr > 

1 W I 

H 

604.200000 

2.443870 

<.0001 

L 

657.200000 

2.443870 

<.0CC1 


Standard 

A B Y LSMEAN Error Pr > It! 


I — I i—I 

o o o o 
o o o o 
o o o c 
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(iv) Slicing the A * F interaction indicates that the simple A-effects for variety Am 
are not significantly different from each other. Inspection of th&V ^ A LS means 
shows that the Ax V interaction is codirectional. □ 

11.5.8 Summary 

It is virtually impossible to discuss all the possible ramifications that a complex treat¬ 
ment and blocking structure may have on the data model, assumptions about the terms 
in the model, the ensuing analysis, the implications of existing interactions, and the 
interpretation of the results. The two experiments discussed above serve as examples 
of some strategies we may apply. 

First of all we suggest to write out a model including all possible interactions as 
determined by the treatment and blocking structure of a given design. Based on subject 
matter knowledge or previous results we may then decide to drop certain treatment- 
block factor interactions (we can never drop interactions between blocking factors, 
whether they are real or not), which then become part of the experimental error. In 
doing so we should always withstand the temptation of convenience or the desire to 
obtain additional d.f. for error. The latter may be accomplished through a preliminary 
test in the context of the ANOVA. 

For the remaining interactions we have indicated different approaches, such as 
looking at LS means via interaction plots, using the SLICE operator in SAS PROC 
GLM, considering simple two-factor interactions and sets of orthogonal contrasts. Not 
always is it possible to arrive at a simple answer, especially if the structure is very com¬ 
plicated. The most important point is to stay within the objective of the experiment and 
perhaps formulate new objectives for a follow-up experiment. 


11.6 2 n FACTORIALS IN INCOMPLETE 
BLOCKS 

As mentioned earlier the number of treatment combinations in a factorial experiment 
may be quite large. If an error-reduction design with blocking has to be used we may 
not be in a position to have sufficiently homogeneous blocks large enough to accom¬ 
modate all the treatment combinations. Hence some form of incomplete block design 
is called for. Obviously, any of the incomplete block designs we have described earlier 
can be used. For factorial experiments there exist, however, special methods of con¬ 
structing incomplete block designs based on the assumption or knowledge that certain 
interactions are negligible or of lesser importance. We shall illustrate this for the 2 n 
factorial, specifically for the 2 3 factorial. Generalizations are discussed in detail in 
Chapters II. 8-12. 

11.6.1 2 3 Factorial in Blocks of Size 4 

Suppose we have three factors A, B, C, and we have available blocks of size 4. If we 
assume that the three-factor interaction ABC is negligible we can then arrange the treat¬ 
ment combinations in such a way that ABC becomes nonestimable or, as we say, ABC 
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is confounded with blocks. The idea is to assign those treatment combinations which 
enter positively into ABC to one block and those that enter negatively into another block 
and then replicate this basic arrangement r times. Recall now (or see Table 11.1) that 

ABC = |(ai — ao)(b\ — bo)(ci — Co) 

= \[a\biCi - aibico - aib 0 Ci + aib 0 co - aohci 
+ a 0 bic 0 + a 0 b 0 ci - a 0 b 0 co}. 


Hence, the basic block arrangement is then as follows: 

Block 1 : ai^iCi,aifcoQ)*^o^ico ； Q^o^oCi 
Block2: aifciCo,ao^i c i? a o^oCo- 

Using the familiar model (suppressing subscripts) 

y = fi +,3+ r-\-e 

for each observation, we see immediately that for our arrangement 

E(ABC) = 0i - ,8 2 + ABC. 

This illustrates the phrase that 乂 BC is nonestimable, namely that for this design ABC 
is a biased estimator for ABC, the bias being ,5i — /? 2 , the difference of block effects. 
Thus also the phrase: ABC is confounded with blocks, that is, ABC and j3\ - 02 
cannot be separated. 

We see, however, from arrangement (11.27) that all other main effects and interac¬ 
tions are estimable in the usual way. The reason for that is that for every other effect 
each block contains two treatment combinations which enter positively into the effect 
and two which enter negatively so that the block effects cancel each other. Consider, 


for example, main effect A: 


Block 1: positively: 

aib\Ci, <2i^oCo 

negatively: 

dobiCo, aoboCi 

Block 2: positively: 

a l^0 c l 

negatively: 

ao&ici ， aoboCo- 


Suppose we replicate the arrangement (11.27) r times, that is, we have 2r blocks of 
size 4 altogether. If we denote the block totals by Bi{i = 1,2...., 2r) and the grand 
total by G, we can then write the ANOVA as given in Table 11.9. 

The important point to note here is that because of the confounding of ABC there 
are only 6 d.f. for treatments rather than the usual 7. This design is thus an example of 
what we have called earlier (see Section 9.8) a disconnected design, except that this is 
the result of a deliberate choice on our part. 

11.6.2 2 3 Factorial in Blocks of Size 2 

This method of constructing incomplete block arrangements or, as they are also called, 
systems of confounding, can be used for blocks with size equal to a power of 2 (the 


(11.27) 


(11.28) 
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Table 11.9 ANOVA for 2 3 Factorial in Blocks of Size 4 
and ABC Confounded 


Source 

d.f. 

SS 

E(MS) 

Blocks 

2r- 1 



Treatments 

6 



A 

1 

2r\A} 2 

^+2r[A] 2 

B 

1 

2r[B} 2 

^ + 2r[B] 2 

AB 

1 

2r[AB) 2 

a 2 e + 2r[AB} 2 

C 

1 

2r[C) 2 

<r e 2 +2r[C] 2 

AC 

1 

2r[AC} 2 

^ + 2r[AC] 2 

BC 

1 

2r[BC] 2 

a 2 e + 2r[BCf 

Error 

6(r- 1) 

Difference 


Total 

8r- 1 

T.v 2 -bG 2 



disadvantage, of course, is that it can only be used for blocks with size equal to a 
power of 2). For our example we shall now also consider the situation where we have 
available blocks of size 2. 

The general idea is to first partition the treatment combinations into two sets based 
upon the sign with which they enter into a certain interaction, say ABC as above: 

+ : dibiCi ， a 160^0? o.obiCos o,oboCi 
一： aibiCQ, aiboCi^aobiCi ， ao&o c o 

Then each set above is partitioned again into sets of two based upon the sign with which 
they enter into another main effect or interaction, say BC. We then obtain the following 
partition: 

Sign for 


Block ABC BC Treatment Combination 


1 + + ctib\C\^ CLiboCo 

2 + — a-obiCo, ao&oCi 

3 — + ciobiCi^ ao^o c o 

4 — — dibiCQ, ciib[)Ci 


(11.29) 


These four sets then constitute the basic arrangement in blocks of size 2, and this 
arrangement will be replicated r times, giving us 4r blocks. 
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The way in which arrangement (11.29) was constructed implies that ABC and BC 
are confounded with blocks. By inspection we also see that the main effect A is con¬ 
founded with blocks, because 

E{A) = I (8l — 02 — 03 + /?4) + A 

but all other effects are estimable, that is, not confounded. This is a consequence of 
our method of construction: Since we have four blocks with three d.f. among them, we 
have to confound three main effects or interactions (each with 1 d.f.) with blocks. Two 
of these three effects are chosen independently (ABC and BC in our case). The third 
effect is then determined automatically and can be obtained formally from 

ABC xBC = AB 2 C 2 = A (11.30) 

that is, by formally multiplying the confounded effects into each other and then drop¬ 
ping any letter raised to the second power (for the mathematics behind this see Chap¬ 
ter II.7). We also refer to A in this case as the generalized interaction between ABC and 
BC. The general rule concerning confounding then says: if two effects are confounded 
with blocks, then their generalized interaction is also confounded with blocks. 

11.6.3 Partial Confounding 

It is generally undesirable to confound main effects and two-factor interactions with 
blocks. In our simple (but not unrealistic) example this is unavoidable. We can see this 
by simply listing all possible systems of confounding: 


(i) ABC, AB, ABC x AB = C 

(ii) ABC, AC, ABC x AC = B 

(iii) ABC, BC, ABC x BC = A 

(iv) AB, AC, AB x AC = BC (11.31) 

(v) A,B,A x B = AB 
(\i)A,C,AxC = AC 
(vii) B,C,BxC = BC. 

Clearly, (v), (vi) and (vii) are the most undesirable systems, and only system (iv) 
avoids confounding main effects but at the price of confounding all three 2-factor in¬ 
teractions. To avoid complete loss of information about the effects in one of the sets in 
(11.31) (and thereby obtaining full information about the remaining effects), we may 
use a compromise solution by constructing a design based on several systems of con¬ 
founding dictated largely by the requirements of the experiment. This may result in 
complete loss of information for some effects, partial information (to varying degrees) 
on other effects, and full information on the remaining effects. Such a method is re¬ 
ferred to as partial confounding (as compared to complete confounding as described 
above). We shall now give a simple example. 
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EXAMPLE 11.3: Suppose the requirements of our experiment for the 2 3 factorial are 

(a) We would like as much information about main effects as possible. 

(b) All 2-factor interactions are equally important and we need some information 
about them. 

(c) The 3-factor interaction is most likely negligible. 

(d) We have available 16 blocks of size 2. 

Since each of the systems of confounding listed in (11.31) gives rise to four blocks, one 
possibility then is to use four of those systems, each giving rise to a complete replicate. 
To satisfy the requirements of the experiment, we choose systems (i), (ii), (iii), (iv) and 
label the resulting arrangements Rep. I, Rep. II ， Rep. III ， Rep. IV, respectively, with 
the following confounded and estimable effects: 


Effects 


Replicate 

Confounded 

Estimable 

I 

C, AB, ABC 

A, S, AC, BC 

I 

B ， AC, ABC 

A, C, AB, BC 

III 

A t BC，ABC 

B, C, AB, AC 

IV 

AB, AC，BC 

A ， B ， C,ABC 


Over the whole experiment the amount of information for the individual effects is 
then as follows: 

Effect Amount of Information 


A 

3/4 

B 

3/4 

AB 

1/2 

C 

3/4 

AC 

1/2 

BC 

1/2 

ABC 

1/4 


This seems to be a reasonable arrangement, in that it gives equal information about the 
main effects (3/4) and about the 2-factor interactions (1/2), and it gives some informa¬ 
tion about the 3-factor interaction. 

The actual layout, that is, the assignment of the treatment combinations to the 
blocks can be obtained following the rule given above. The result is presented in Ta¬ 
ble 11.10. ^ 

As discussed above, all effects are estimable only from some of the replicates, 
namely those in which they are not confounded. This can be displayed as follows: 
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Effect 

Estimable from 
Replicates 

Effective # 
of replications (r) 

A 

I ， II， IV 

3 

B 

I, III, IV 

3 

AB 

II， III 

2 

C 

II, III, IV 

3 

AC 

I, III 

2 

BC 

i, ii 

2 

ABC 

IV 

1 


The difference in the effective number of replications implies that the effects are esti¬ 
mated with different precision, that is, in the general formula (11.14) for the variance 
of the estimate of an effect, r now takes on the values given above, that is, 

var(l) = var ( 后 ) =var(C) = 

vai(AB) = vax(AC) = var(^C) = 

2 • 2 

var(ABC) = 

This unequal replication is, of course, also reflected in the ANOVA as given in Ta¬ 
ble 11.11. Here，for example, Auijy indicates that the main effect A is estimated 
from the observations in replicates I, II, IV only. □ 

The examples discussed above are meant to be an introduction to the notion of con¬ 
founding and partial confounding as well as the construction and analysis of appropri¬ 
ate designs. Using these examples the reader should have no difficulty applying these 
ideas to other 2 n factorials in appropriate incomplete blocks. The task will be made 
easier, however, by applying the mathematical tools provided in Chapters II.8 and 9. 
Extensions to other factorial experiments are discussed in detail in Chapters 11.10-12. 
A convenient tool to generate systems of confounding is provided by SAS PROC FAC- 
TEX (SAS Institute, Inc. 2002 — 2003). For an illustration see Examples 11.11 and 
11.12 in Section 11.11. 

11.7 FRACTIONS OF 2 n FACTORIALS 

11.7.1 Rationale for Fractional Replication 

In our discussion so far we have used designs in which all treatment combinations have 
been used the same number of times. This may not always be very practical, in partic¬ 
ular if the number of factors, n, is quite large. And as we have mentioned earlier, 2 n 
factorial experiments are very valuable for exploratory experiments involving a large 
number of factors. 
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Table 11.10 Partial Confounding of a 2 3 
Factorial in Blocks of Size 2 


Replicate Block Treatment Combinations 


1 


2 

3 

4 

II 5 
6 

7 

8 

III 9 
10 
11 
12 

IV 13 

14 

15 

16 


ciibiCi，aoboCi 
dibocoy aobiCo 
dibiCo, aoboco 
diboci. a^biCi 
a\biCi. aobiCo 
o^iboco, cioboCi 
ciiboci, aoboco 
G 1^1 Co ； CLobiCi 
aibiCi，aiboCo 
aobiCo, aoboCi 
aobiCi.aoboCo 
tti^iCo, aiboci 
aoboCi ， a-ibiCQ 
aiboCi.aobiCo 

aibiCi,aoboCo 
a\boCQ. aobici 


The main reason for imposing the restriction that each of the treatment combina¬ 
tions is to be tested an equal number of times is that it results in the estimates of main 
effects and interactions having maximum precision and being uncorrelated. These, of 
course, are two reasonable and desirable properties. But is maximum precision really 
always necessary? Under what conditions can we achieve a precision that we may 
consider to be “sufficient” from a practical point of view? 

The question we then ask here is whether it is always necessary to test all factorial 
combinations equally frequently or whether we can omit some of them. To get some 
insight into this question let us consider the following example. 

Example 11.4: Consider a 2 8 factorial, yielding 256 treatment combinations, but of 
the 255 d.f. only 36 account for the main effects (8) and the 2-factor interactions (28 )， 
with the remaining d.f. belonging to higher order interactions. Even if every treatment 
combination is tested only once, that is, r = 1, in blocks of size 16, say, and assuming 
that all interactions involving three or more factors are negligible, the breakdown of the 
d.f. in the ANOVA is as follows: 
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Table 11.11 ANOVA for Partially Confounded 2 3 
Factorial of Table 11.10 


Source 

d.f. 

SS 

E(MS) 

Blocks 

15 

16 9 


Treatments 

7 



A 

1 

6[^i,n,ivj 2 

4 + 6 ⑷ 2 

B 

1 

e^unjv] 2 

a e 2 + 6 问 2 

AB 

1 

4[ABii,iii] 2 

a 2 e + 4 [圳 2 

C 

1 

6[Ciijiijv] 2 

d + 6[Cf 

AC 

1 

4 [. 4 C U „] 2 

a 2 e + i[AC} 2 

BC 

1 

4[SCi.ii] 2 

〜 2 + 刺 2 

ABC 

1 

2[ABC lv } 2 

a 2 e + 2[ABC} 2 

Error 

9 

Difference 


Total 

31 

J2v 2 - % 



Source d.f. 

Blocks 15 

Main effects 8 

2-factor interactions 28 
Error 204 

Total 255 


The large number of d.f. for error (stemming from the negligible higher order inter¬ 
actions) may be more than is necessary and the precision that would result for the 
estimation of main effects and 2-factor interactions, given by a variance of a,/64, may 
well be unnecessarily high. We shall see later that under the assumptions made above 
64 carefully chosen treatment combinations may indeed provide sufficient information. 
Such a design is called ^fractional factorial or fractional replication, in this case a 1/4 
replication of a 2 s factorial. □ 
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11 J. 2 1/2 Fraction of the 2 3 Factorial 

We shall now explain the concept of a fractional factorial, that is, the choice of the 
treatment combinations to be used, in terms of a simple (but not practical) example. 
Suppose we have n = 3 factors, A, B, C, and we can use only four treatment combina¬ 
tions. How should we choose them and, having chosen them, what kind of information 
can we obtain? 

We are interested then in a 1/2 fraction of the 2 3 factorial. If we assume that the 
interaction ABC is negligible, we might choose as the 1/2 fraction either those treatment 
combinations which enter positively into ABC or those that enter negatively into ABC, 
that is, 

+ : aibici,ai6 0 c 0 ,a 0 6ic 0 .a 0 &oCi 
~ : ^o^oCo ； o^obiCi, aiboCi, aibiCo, 

Suppose we choose the fraction. Let us then examine how we would use these 
four treatment combinations to estimate main effects and 2-factor interactions. We can 
deduce this easily from the following table (which is obtained from Table 11.1): 



| A 

B 

AB 

C 

AC 

BC 

a-ibiCi 

+ 

+ 

+ 

+ 

+ 

+ 

a-iboco 

+ 

- 

- 

— 

— 

+ 

o-obiCo 


+ 

- 

- 

+ 

一 

«0^0 c l 

- 

— 

+ 

+ 

— 

— 


The use of SAS PROC FACTEX to generate this design is illustrated in Example 11.13 
(See Section 11.11). Substituting the treatment means y(aibjCk) for the treatment com¬ 
binations (a{bjCk) we see, by inspection, that A is estimated in the same way as BC，B 
in the same way as AC, and C in the same way as AB; that is, 

E{\[y{aibic{) + y{aib 0 co) - y{a 0 biCo) - y(aob 0 ci)}} = ABC (11.32) 

E{\[y{a\biCi) - y(ai6 0 co) + y(a 0 6ic 0 ) - y(a 0 6oCi)]} = B + AC (11.33) 

E{\[y{aibici) - y(aib 0 c Q ) - y(a 0 bic Q ) + y(a 0 b 0 ci)}} = C ^ AB (11.34) 

11.7.3 The Alias Structure 

Equations (11.32)-(11.34) show that there is no way of estimating individually A, B, 
AB, etc., but only linear combinations of them. This can also be deduced intuitively 
by noticing that there are only three orthogonal contrasts among the four treatment 
combinations and those correspond to (11.32) ， (11.33), and (11.34). We then say that 
A and BC, B and AC, and C and AB are confounded with each other or aliased, that 
is, we cannot estimate them separately. Formally we can obtain the so-called alias 
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structure by realizing that the mean and ABC are confounded with each other which 
we express in the form of an algebraic identity as 

I = ABC. (11.35) 

This relation is known as the defining relation (contrast) or the identity relationship. It 
determines the type of fraction we are choosing and the alias structure by interpreting 
I as unity and formally multiplying each effect into both sides of (11.35) and deleting 
any letter raised to the power 2; thus 


A = A(ABC) - A 2 BC = 

=BC 

(11.36) 

B = B(ABC) = AB 2 C = 

=AC 

(11.37) 

C = C{ABC ) 二 ABC 2 = 

=AB. 

(11.38) 


Interpreting the equality sign as “confounded with，’’ then (11.32) and (11.36), (11.33) 
and (11.37), and (11.34) and (11.38) express the same results. In this sense relation 
(11.35) also means that ABC is confounded with the mean. If we want to indicate that 
we have chosen the fraction we may write (11.35) more explicitly as 

I = -^ABC 

or if we have chosen the (complementary) “ 一 ’’ fraction, 

I = -ABC. 

In that case, rather than estimating A + BC we would be able to estimate A - BC ， 
etc. The end result remains the same: the main effects are confounded with 2-factor 
interactions. 

For this fraction, or other fractional factorials in general to be useful we must make 
additional assumptions. In our case we assume that all 2-factor interactions are negli¬ 
gible. Then all main effects become estimable. This 1/2 fraction is therefore referred 
to as a main effect plan ， also as a resolution III design (Box and Hunter, 1961), because 
the interaction (word) in (11.35) consists of three letters and as a consequence main 
effects are aliased with 2-factor interactions. 

If instead of (11.35) we had chosen the defining relation 

I = AB 

to select a 1/2 fraction, the alias structure would have been 

A = B 
C = ABC 
AC = BC. 

This seems a less desirable fraction, if only for the reason that two main effects are 
confounded. 

This simple example has brought out several properties of fractional factorials: 
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(i) Information is being “lost.” 

(ii) The fraction has to be chosen carefully to “minimize” the loss. 

(iii) Assumptions about higher order interactions have to be made in order to obtain 
(unbiased) estimates of main effects (and possibly low-order interactions). 

11.7.4 1/4 Fraction of the 2 8 Factorial 

It is clear from our discussion so far that the defining relation for a 1/2 fraction includes 
the highest-order interaction. But even a 1/2 fraction may still contain too many treat¬ 
ment combinations. Hence fractions of a high degree, such as 1/4, 1/8, etc., may have 
to be considered as viable designs. To illustrate this we now return to Example 11.4, a 
1/4 fraction of a 2 8 factorial with factors A, B, C, D, E, F, G, H. 

In principle we can proceed as follows: 

(i) divide the set of the 2 8 treatment combinations into two sets based upon the sign 
with which they enter into a chosen interaction, E\ say; 

(ii) choose one of those two sets; 


(iii) divide the chosen set again into two sets based upon the sign with which the 
treatment combinations enter into another designated interaction, E 2 , say. 

Since interactions are orthogonal contrasts we know that this will result in a set of 2 6 = 
64 treatment combinations. However, just as in constructing systems of confounding 
(Section 11.6) we must be careful in our choice of E\ and E 2 for the following reason. 
Since all 64 chosen treatment combinations have the same sign in E\ and the same sign 
in 五 2 , Ei and E 2 are confounded with the mean. It is easy to see, however, that the 
generalized interaction Es = E\E^ is then also confounded with the mean, that is, the 
64 chosen treatment combinations also have the same sign in (see Chapter 11.13). 
The question then is: How should we choose E\ and E 2 and hence to obtain a 
fraction with the “most reasonable” alias structure knowing that this will be determined 
from the defining relation 


I = El =E2 = E s . (11.39) 

An intuitive approach might be to start with the highest-order interaction for E\ and 
some other interaction for E 2 , but this may lead to a low-order interaction for Es and 
hence to an undesirable alias structure in that effects which we would like to estimate 
are confounded with each other. In order to approach this problem more systemati¬ 
cally, we first have to decide which effects we want to estimate and which interactions 
we may assume to be negligible. Suppose we want to estimate (if possible) all main 
effects and 2-factor interactions and we assume that all other interactions are negligi¬ 
ble. This means that main effects and 2-factor interactions cannot be confounded with 
other main effects and/or 2-factor interactions. To be assured of this we must have in 


(11.39) that Ei. E 2 and Es are at least 5-factor interactions. Suppose then we choose 
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E 1 = ABODE, E 2 = ABFGH, thenE s = {ABCDE)(ABFGH) = CDEFGH ， 
which obviously is of the required form. Hence we have 

I = ABODE = ABFGH = CDEFGH. (11.40) 

This defining relation indicates that main effects will be confounded with interactions 
involving four or more factors, and 2-factor interactions are confounded with interac¬ 
tions involving three or more factors, for instance, 

A = BCDE = BFGH = ACDEFGH 
AC = BDE = BCFGH = ADEFGH. 

In addition, (11.40) determines the 1/4 fraction. Suppose we decide to choose all treat¬ 
ment combinations which enter negatively into ABCDE and ABFGH ， we see from the 
definitions 

ABCDE = ^(oi — ao)(bi — 6o)( c i - c o)(di — do) 

(ei — eo)(/i + /o)(5i + 夕 o)("i + 九 0 ) 

ABFGH = ^y(ai — ao)(b\ — 6o)( c i + co)(di + do) 

(ei + eo)(/i — fo)(9i _ 9o){hi - h 0 ) 

that the selected treatment combinations must have an odd number of factors from A, 
B ， C ， D，E and A, B, F, G, H at the low (zero) level. Those treatment combinations 
are given in Table 11.12 (in order to simplify the notation we write a treatment combi¬ 
nation as (xi, X 2 , … ，抑 ) where Xi = 0, l(i = 1. 2,... ,8) indicating the low and high 
level of the ith factor, respectively). 

This fractional factorial is also called a resolution V design since the lowest-order 
interaction contained in (11.40) has five factors (letters) and consequently main effects 
are aliased with 4-factor interactions and 2-factor interactions are aliased with 3-factor 
interactions. In many situations a resolution V design is the most desirable fraction 
since it allows the estimation of all main effects and 2-factor interactions, assuming 
that all other interactions are negligible. But even such a fractional factorial may be 
too large, hence the need for resolution III and resolution IV designs. Resolution IV 
designs are fractions in which main effects are confounded with 3-factor interactions 
and 2-factor interactions are confounded with other 2-factor interactions (see Section 
II.13.3.2). 

11.7.5 Systems of Confounding for Fractional Factorials 

As mentioned earlier it may not be possible to arrange the 64 treatment combinations 
in a CRD with r = 1. Instead we may consider, an incomplete block design with 6 = 4 
blocks and A; = 16 EUs per block. To do so we make use of the method described in 
Section 11.6, 




458 


CHAPTER 11. FACTORIAL EXPERIMENTS: BASIC IDEAS 


Table 11.12 1/4 Fraction of the 2 8 Factorial 


T.C. 

# 

Xl 

X2 

X3 

X4 

Xo 

Xq 

X? 

^8 

T.C. 

# 

Xl 

X2 

X3 

Xa 

尤 5 

^6 

X7 

Xg 

1. 

0 

0 

0 

0 

0 

0 

0 

0 

33. 

1 

0 

1 

1 

1 

0 

0 

0 

2. 

1 

1 

0 

0 

0 

0 

0 

0 

34. 

0 

1 

1 

1 

1 

0 

0 

0 

3. 

0 

0 

1 

1 

0 

0 

0 

0 

35. 

1 

0 

0 

0 

1 

0 

0 

0 

4. 

1 

1 

1 

1 

0 

0 

0 

0 

36. 

0 

1 

0 

0 

1 

0 

0 

0 

5. 

0 

0 

1 

0 

1 

0 

0 

0 

37. 

1 

0 

0 

1 

0 

0 

0 

0 

6. 

1 

1 

1 

0 

1 

0 

0 

0 

38. 

0 

1 

0 

1 

0 

0 

0 

0 

7. 

0 

0 

0 

1 

1 

0 

0 

0 

39. 

1 

0 

1 

0 

0 

0 

0 

0 

8. 

1 

1 

0 

1 

1 

0 

0 

0 

40. 

0 

1 

1 

0 

0 

0 

0 

0 

9. 

0 

0 

0 

0 

0 

1 

1 

0 

41. 

1 

0 

1 

1 

1 

1 

1 

0 

10, 

1 

1 

0 

0 

0 

1 

1 

0 

42. 

0 

1 

1 

1 

1 

1 

1 

0 

11. 

0 

0 

1 

1 

0 

1 

1 

0 

43, 

1 

0 

0 

0 

1 

1 

1 

0 

12. 

1 

1 

1 

1 

0 

1 

1 

0 

44. 

0 

1 

0 

0 

1 

1 

1 

0 

13. 

0 

0 

1 

0 

1 

1 

1 

0 

45. 

1 

0 

0 

1 

0 

1 

1 

0 

14. 

1 

1 

1 

0 

1 

1 

1 

0 

46 . 

0 

1 

0 

1 

0 

1 

1 

0 

15. 

0 

0 

0 

1 

1 

1 

1 

0 

47. 

1 

0 

1 

0 

0 

1 

1 

0 

16. 

1 

1 

0 

1 

1 

1 

1 

0 

48. 

0 

1 

1 

0 

0 

1 

1 

0 

17. 

0 

0 

0 

0 

0 

1 

0 

1 

49. 

1 

0 

1 

1 

1 

1 

0 

1 

18. 

1 

1 

0 

0 

0 

1 

0 

1 

50. 

0 

1 

1 

1 

1 

1 

0 

1 

19. 

0 

0 

1 

1 

0 

1 

0 

1 

51. 

1 

0 

0 

0 

1 

1 

0 

1 

20. 

1 

1 

1 

1 

0 

1 

0 

1 

52. 

0 

1 

0 

0 

1 

1 

0 

1 

21. 

0 

0 

1 

0 

1 

1 

0 

1 

53. 

1 

0 

0 

1 

0 

1 

0 

1 

22 , 

1 

1 

1 

0 

1 

1 

0 

1 

54. 

0 

1 

0 

1 

0 

1 

0 

1 

23 . 

0 

0 

0 

1 

1 

1 

0 

1 

55 . 

1 

0 

1 

0 

0 

1 

0 

1 

24 . 

1 

1 

0 

1 

1 

1 

0 

1 

56. 

0 

1 

1 

0 

0 

1 

0 

1 

25 . 

0 

0 

0 

0 

0 

0 

1 

1 

57. 

1 

0 

1 

1 

1 

0 

1 

1 

26. 

1 

1 

0 

0 

0 

0 

1 

1 

58. 

0 

1 

1 

1 

1 

0 

1 

1 

27 , 

0 

0 

1 

1 

0 

0 

1 

1 

59. 

1 

0 

0 

0 

1 

0 

1 

1 

28. 

1 

1 

1 

1 

0 

0 

1 

1 

60. 

0 

1 

0 

0 

1 

0 

1 

1 

29. 

0 

0 

1 

0 

1 

0 

1 

1 

61. 

1 

0 

0 

1 

0 

0 

1 

1 

30. 

1 

1 

1 

0 

1 

0 

1 

1 

62. 

0 

1 

0 

1 

0 

0 

1 

1 

31. 

0 

0 

0 

1 

1 

0 

1 

1 

63. 

1 

0 

1 

0 

0 

0 

1 

1 

32. 

1 

1 

0 

1 

1 

0 

1 

1 

64. 

0 

1 

1 

0 

0 

0 

1 

1 
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Example 11.5: For a ~ fraction of a 2 8 factorial, that is, a 2 8-2 fractional 
factorial we obviously do not want to confound main effects and 2-factor interactions. 
These account for only 36 estimable functions of effects out of the 63 available. We 
therefore need to find two estimable functions of effects which were assumed to be 
negligible earlier. We then confound these effects and their generalized interaction 
with blocks. It follows from (11.40) that, for example, 

ACF = BDEF = BCGH = ADEGH (11.41) 


and 


BDG = ACEG = ADFH = BCEFH 


are such functions and their generalized interaction 


ABCDFG = EFG 二 CDH = ABEH 


(11.42) 

(11.43) 


is also of such form. The four blocks are then obtained by considering the signs with 
which each of the 64 treatment combinations obtained from (11.40) and given in Table 
11.12 enters into ACF (11.41) and BDG (11.42) as follows: 



In Table 11.13 we give those signs for each treatment combination and in Ta¬ 
ble 11.14 we give the final design. 

The main effects and 2-factor interactions can then be estimated in the usual way. 
For an effect X we have 

X = Q \ , [sum of 2 5 obs. — sum of remaining 2 5 obs.l 

Oo 一 2 , 一 1 1 J 

with 1 ] 

var(X) = 

We also have for the ANOVA (as outlined earlier) 

SS(X) = 2 4 [.Y] 2 . 

The sums of squares associated with higher order interactions, except those given in 
(11.41), (11.42), and (11.43), are, of course, part of the SS(Error). □ 


The methods of obtaining the 1/4 fraction and the block arrangements may ap¬ 
pear rather tedious, but they illustrate the underlying principles. More expeditious 
methods are described in Chapters 11.13 and 14 together with methods of obtaining 
fractional factorials for other factorial experiments. See also Section 11.11 illustrating 
SAS PROC FACTEX (SAS Institute, Inc. 2002 - 2003). 
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Table 11.13 Signs with Which Treatment Combinations 
in Table 11.12 Enter into ACF and BDG 


T.C.# 

ACF 

BDG 

T.C.# 

ACF 

BDG 

1. 

— 

—— 

33. 

— 

+ 

2. 

+ 

+ 

34. 

+ 

- 

3. 

+ 

+ 

35. 

+ 

— 

4. 

- 

— 

36. 

- 

+ 

5. 

+ 

— 

37. 

+ 

+ 

6. 

— 

+ 

38. 

— 

— 

7. 

- 

+ 

39. 

- 

— 

8. 

+ 


40. 

+ 

+ 

9. 

+ 

+ 

41. 

+ 

— 

10. 

— 

— 

42. 

— 

+ 

11. 

— 

— 

43. 

— 

+ 

12. 

+ 

+ 

44. 

+ 

- 

13. 

- 

+ 

45. 

— 

— 

14. 

+ 

— 

46. 

+ 

+ 

15. 

+ 

一 

47. 

+ 

+ 

16. 

-- 

+ 

48. 

— 

- 

17. 

+ 

- 

49. 

+ 

+ 

18. 

— 

+ 

50. 

— 

— 

19. 

— 

+ 

51. 

— 

— 

20. 

+ 

— 

52. 

+ 

+ 

21. 

— 

— 

53. 

— 

+ 

22. 

+ 

+ 

54. 

+ 

一 

23. 

+ 

+ 

55. 

+ 

— 

24. 

一 

— 

56. 

— 

+ 

25. 

- 

+ 

57. 

— 

— 

26. 

+ 

- 

58. 

+ 

+ 

27. 

+ 

— 

59. 

+ 

+ 

28. 

一 

+ 

60. 

- 

— 

29. 

+ 

+ 

61. 

+ 

— 

30. 

一 

- 

62. 

— 

+ 

31. 

— 

— 

63. 

一 

4 - 

32. 

+ 

+ 

64. 

+ 
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Table 11.14 1/4 Fraction of 2 s Factorial in Blocks of Size 16 






Block 1 






Block 2 




1 

1 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

1 

0 

0 

0 

0 

0 

1 

1 

0 

0 

0 

0 

1 

1 

0 

1 

1 

0 

0 

0 

0 

0 

0 

0 

0 

1 

1 

0 

1 

1 

1 

0 

1 

1 

1 

0 

1 

1 

1 

1 

0 

1 

1 

0 

0 

0 

0 

1 

1 

1 

1 

0 

1 

1 

1 

0 

1 

1 

0 

1 

0 

0 

0 

0 

0 

1 

0 

1 

0 

0 

0 

1 

1 

1 

0 

1 

1 

1 

1 

1 

0 

1 

0 

1 

0 

0 

1 

0 

1 

0 

1 

1 

1 

1 

0 

0 

0 

0 

1 

1 

1 

1 

0 

1 

1 

0 

1 

1 

0 

0 

1 

1 

0 

0 

1 

1 

1 

0 

0 

1 

0 

0 

0 

0 

0 

1 

1 

1 

1 

0 

0 

0 

0 

1 

1 

0 

0 

0 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

1 

0 

1 

0 

1 

1 

0 

1 

0 

1 

1 

1 

1 

1 

0 

1 

0 

1 

0 

0 

1 

1 

0 

0 

1 

0 

0 

1 

1 

1 

0 

1 

0 

1 

1 

1 

1 

0 

1 

0 

1 

0 

1 

0 

1 

0 

1 

0 

1 

0 

0 

1 

1 

0 

1 

1 

0 

1 

0 

0 

1 

0 

1 

0 

1 

1 

1 

1 

0 

1 

1 

1 

0 

0 

1 

0 

0 

1 

1 

1 

0 

0 

0 

1 

0 

1 

1 

0 

1 

1 

0 

0 

0 

1 

1 





Block 3 






Block 4 




1 

1 

1 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

1 

0 

0 

0 

1 

1 

1 

1 

0 

0 

0 

0 

0 

0 

1 

0 

1 

1 

1 

0 

1 

1 

0 

0 

0 

1 

1 

0 

1 

1 

0 

1 

1 

1 

1 

0 

0 

0 

1 

1 

0 

1 

1 

0 

1 

1 

0 

0 

0 

1 

0 

1 

0 

0 

1 

0 

1 

1 

0 

1 

0 

0 

1 

1 

0 

1 

0 

1 

1 

1 

0 

1 

1 

1 

0 

1 

0 

0 

0 

0 

0 

0 

1 

1 

1 

1 

1 

0 

1 

0 

1 

1 

1 

1 

1 

1 

0 

0 

1 

1 

0 

0 

0 

1 

1 

0 

1 

1 

1 

0 

1 

1 

1 

0 

0 

0 

0 

1 

0 

1 

0 

0 

0 

0 

0 

1 

0 

0 

1 

0 

0 

0 

1 

0 

1 

0 

0 

0 

0 

0 

0 

1 

1 

1 

1 

1 

1 

0 

1 

0 

0 

1 

0 

1 

1 

0 

1 

0 

0 

0 

1 

1 

1 

0 

0 

1 

1 

0 

0 

1 

1 

0 

1 

0 

0 

1 

0 

1 

0 

1 

0 

1 

1 

1 

1 

1 

0 

1 

0 

1 

1 

0 

0 

1 

0 

1 

1 

0 

0 

0 

1 

1 

0 

1 

0 

1 

0 

1 

0 

0 

1 

1 

1 

0 

1 

1 

1 

0 

1 

1 

1 

0 

1 

0 

0 

0 

1 

1 

0 

1 

0 

0 

1 

0 

1 

1 
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11.8 ORTHOGONAL MAIN EFFECT PLANS FOR 
2 n FACTORIALS 

Among the fractional factorials discussed in the previous section, resolution III designs 
or main effect plans are of particular importance. We shall give a brief discussion here 
but defer details to Chapter 11.14. 

As mentioned earlier, the value of factorial experiments in general lies in the fact 
that higher order interactions are usually negligible. This leads to a considerable re¬ 
duction in the number of parameters, that is, treatment effects or, more specifically, 
treatment effect contrasts, that need to be considered in the analysis of the data from 
such experiments. This, in turn, also leads to a reduction in the number of treatment 
combinations to be used in an experiment and hence to a reduction in the number of 
observations to be taken. It is in this sense that factorial experiments can be very eco¬ 
nomical. 

The extreme situation is obviously achieved if all interactions can be considered 
negligible, so that the treatment effects can be represented in terms of main effects 
only. 

Example 11.6: For n = 3, we can rewrite model (11.2) as 

丁 ijk = /x H- An + A2j + Ask- (11.44) 

For the 2 3 situation, which is of concern here, (11.44) can be rewritten in terms of the 
main effects as defined in (11.5) and Table 11.1 as follows (replacing Ai by A, A 2 by 
B, and A 3 by C): 

r ijk = ^±\A±\B±\C (11.45) 

where i.j, fc = 0,1 and the signs on the right-hand side of (11.45) depend on the values 
of i, j, and k in that the minus sign is used if the corresponding subscript is 0 and the 
plus sign is used if the corresponding subscript is 1. For example, 

Ton = ciobiCi = fj, — + + \C 

The equivalence of (11.44) and (11.45) can be verified easily by using the definition 
of the main effects for the 2 n factorial and taking " = E aibjCk/ 2 3 (for details see 
Chapter II.7). The model (11.45) can obviously be extended to the general 2 n factorial. 

□ 

In general then, if the assumption of no interactions is reasonable, we need to be 
able to estimate only 1 + n parameters (the mean and n main effects) for an experiment 
with n factors each at two levels. We have seen in Section 11.6.2 that, for example, 
for n — 3 this can be achieved with 4 treatment combinations, that is, a |-fraction of 
the 2 3 experiment. The treatment combinations (runs) used are listed below where the 
levels of the factors A, B,C for runs 1 ， 2, 3, 4 are given in the body of the table: 
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Factor 


Run # 

A 

B 

c 

Observation 

1 

1 

1 

1 

Vi 

2 

1 

0 

0 

V2 

3 

0 

1 

0 

ys 

4 

0 

0 

1 

V4 


Inspection of the table shows that for each pair of factors, that is, (A, B) ， (A, C), 
(B ， C), each possible ordered combination of zeros and ones occurs the same number 
of times, in this case once. Such arrangements are called orthogonal arrays (more 
precisely, orthogonal arrays of strength two) and in connection with fractional factorials 
the design is called an orthogonal main effect plan. As the name suggests these plans 
allow the estimation of main effects (under the assumption of no interactions), and the 
estimators for the main effects are uncorrelated. For example, with the observations 
for the design above denoted by yi, 2 / 2 , 2 / 3 , 2 / 4 , we have 




z \{yi + V2 — V2, — IJ 4 ) 

and 

B = 

z — V2 + U3 — ua) 

with 

cov(^4, B) 

= i(^e-^e-^e+^e) = 

and so on. 




The main effect plans described above have been given considerable prominence 
in industrial and process development, associated with the name Taguchi (for example, 
Taguchi ， 1986). An interesting example of applying a main effect plan in product 
development is a tile experiment described by Taguchi (1986, pp. 80-83): 

Example 11.7: Seven factors, all related to the apportionment of materials in tile 
production and each having two levels (level 0 = level used in current production, 
level 1 = level thought to be superior in terms of cost and quality), are to be investi¬ 
gated in an effort to find the “best” combination of levels. More precisely, the factors 
and their levels were the following: 


Factor 

Level 

.... 

0 

1 

A: Lime additive content 

1% 

5% 

B: Granularity of additive 

coarse 

fine 

C: Agalmatolite content 

53% 

43% 

D: Type of agalmatolite 

current mixture 

less expensive mixture 

E: Charge quantity 

1,300 kg 

1,200 kg 

F: Waste return content 

4% 

0% 

G: Feldspar content 

5% 

0% 
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The design used was an orthogonal main effect plan for 7 factors using the following 8 
treatment combinations: 


Run # 

Factor 

A 

B 

c 

D 

E 

F 

G 

1 

0 

0 

0 

0 

0 

0 

0 

2 

0 

0 

0 

1 

1 

1 

1 

3 

0 

1 

1 

0 

0 

1 

1 

4 

0 

1 

1 

1 

1 

0 

0 

5 

1 

0 

1 

0 

1 

0 

1 

6 

1 

0 

1 

1 

0 

1 

0 

7 

1 

1 

0 

0 

1 

1 

0 

8 

1 

1 

0 

1 

0 

0 

1 


The data consisted of the percent defective tiles. After estimating the main effects 
and using a model of the form (11.45) the optimum set of conditions was found to be 
aibiCodoeifigo, (Taguchi uses a slightly different argument and the reader is referred 
to his account (Taguchi, 1986 p. 83).) □ 

The same design as given above can also be used for fewer than 7 factors, say 5, 
by simply omitting factors F and G, say. This allows us to estimate the experimental 
error by using the contrasts that would otherwise have been the estimates of the main 
effects F and G. 

This last remark points out a potential difficulty with what are called saturated 
main effect plans ，for example, a main effect plan for a 2 3 factorial in 4 runs or for a 
2 7 factorial in 8 runs, do not allow estimation of error. Such information must then be 
obtained from external sources or the experiment must be enlarged by replication of at 
least some treatment combinations. 

In the same way as described above we can examine 11 or fewer factors with 12 
runs in a plan originated by Plackett and Burman (1946) or 15 or fewer factors with 
16 runs using an orthogonal array, and so on. Construction of such designs will be 
discussed in Chapter 11.14. 


11.9 EXPERIMENTS WITH FACTORS AT 
THREE LEVELS 

We have mentioned earlier the usefulness of 2 n factorial experiments, especially for 
exploratory studies. We have, however, pointed out also that 2 n factorials allow us to 
study relatively simple effect structures only. To be more specific, in studying quan¬ 
titative factors we can make inferences only about linear (main) effects and linear x 
linear-type interactions. This can easily be understood by writing the linear model as a 



11.9. EXPERIMENTS WITH FACTORS AT THREE LEVELS 


465 


regression model. For example, for a 2 3 factorial in a CRD we write 

y{xiX 2 X 3 )i = A) + (3^1 -h 02X 2 + 53^3 

+ /?12^1^2 + 013^1^3 + 023X 2 Xs + 0123^1^3 

+ e(x 1 x 2 x 3 )i (, = 1,2, … ， r), (11.46) 

where X\, X2, X3 represent the (coded) levels of factors A, B, C, respectively, with 
Xi = -1 ， l(i = 1 ， 2, 3). In this model the regression coefficients, /? 2 , and /? 3 , are 
then (apart from a constant) the main effects of A, B, C, respectively, ,3!2, P13, and 
P 23 are the two-factor interactions between A and B, A and C, and B and C ， respec¬ 
tively, and /?i 23 is the 3-factor interaction between A, B, C. These seven regression 
coefficients account for the seven d.f. among the eight treatment combinations. There 
is, therefore, no opportunity to explore possible curvature in the main effects. For this 
we need at least three levels for each factor so that we have at least two d.f. for each 
main effect. 

11.9.1 The 3 2 Factorial 

Suppose then we have n factors A, B, C , … each at three levels. This is referred to 
as a 3 n factorial which can be used in conjunction with any error control design. Let 
us consider specifically the case n = 2 for purposes of illustration. Model (11.1) then 
reduces to 

Tij = T (T ； j. — T..) + ( 丁 .j — T..) + 、丁 ij — 丁 +i. 一 丁 .j + T..) 

or 

Tij = fi' Ai Bj + (AB)ij (11.47) 

with i,j = 0.1,2 representing the levels of the factors A and B. The main effects 
for A and B account for two d.f. each and the interaction between factors A and B 
accounts for four d.f. making up the eight d.f. among the nine treatment combinations. 
These d.f. can be partitioned further (see Section 7.2) depending on the nature (that is, 
qualitative or quantitative) of the factors. We shall consider here the case where both 
factors are quantitative. 

Let Xu and X 21 (I = 0,1,2) denote the equally spaced levels of factors A and B, 
respectively. Then 

— Xu — Xj, 

Zl — 

are the coded levels, with xio = -1, xn = 0, Xi 2 = -f l(i = 1,2). Similar to (11.46) 
we can then write a model of the form 

y(xuX2l>)m — /?0 + 0\X\l + 02^21' + Pll x ll + 022^2l , + Pl2^ll^2V 

+ 0122^1l^2U + 8 ll 2 xliX 2 U 9ll22x\ixl v + e{xuX 2 u)m (11.48) 
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(LI = 0,1. 2; m = 1,2,... ,r). This is an explicit model accounting for all d.f. for 
main effects and two-factor interactions. Using the method of least squares, estimates 
of the regression coefficients can be obtained. Tests of hypotheses can be performed 
concerning these regression coefficients (see Chapter 4). Although this is straightfor¬ 
ward, the interpretation is not always easy since the estimators are correlated. A more 
convenient way sometimes is to use a representation in terms of orthogonal polynomi¬ 
als (see Section 7.2). 

Let Po{x), Pi(x), and P 2 (x) be the zero-th, first, and second order polynomials, 
respectively, for i = 3 (see Table 7.3). We can then rewrite (11.48) as 

2 

y{xiuX 2 v)m = ^2 + e ( x U,X2l')m 

i,i'=0 

=ckoo + a w Pi(xuj + a 0 \Pi(x2v) + a 2 o-P2(a ： ii) 

+ a 0 2P2(X2l') + OiiiPi{xu)Pi{X2v) 

+ Oii2Pl{xu)P 2 {X2l') + a 2 lP2(xu)Pi(x2v) 

+ a 22 P 2 {xil)P 2 {x 2 l') + e{xuX 2 l')m- (11.49) 

To make this representation more explicit it is useful to write (11.49) in matrix nota¬ 
tion as 

y = Xa + e ? (11.50) 

where y is the column vector of observations, X is the design-model matrix of known 
constants, a is the column vector of regression coefficients, and e is the column vector 
of errors. To simplify the notation we consider the case r = 1. The matrix X is then as 
given in Table 11.15 (for r > 1， each row of X is repeated r times). Since the columns 
of X are orthogonal to each other, X’X is a diagonal matrix, and it is then easy to obtain 


Table 11.15 Design-Model Matrix X for 3 2 Factorial 



d = (X'X) -1 ^ 
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Table 11.16 ANOVA for Model (11.49) 


Source 

d.f. 

SS 

五 (MS) 

Treatments 

8 

SS(T) 


Al 

1 

6r(di 0 ) 2 

+ 6r(aio) 2 

B l 

1 

6r(d 0 i) 2 

4 - 6r(a 0 i) 2 

a q 

1 

18r(d 2 o) 2 

erf + 18r(a 20 ) 2 

Bq 

1 

18 r(d 0 2) 2 

+ 18r(a 0 2) 2 

Al x Bl 

1 

4r(Qn ) 2 

+ 4r(an) 2 

Al x Bq 

1 

12r(an) 2 

<y 2 e + 12r(ai2) 2 

Aq x Bl 

1 

12r(d 2 i) 2 

-f 12r(a2i) 2 

Aq X Bq 

1 

36r(d 2 2) 2 

+ 36 咖 22) 2 

Error 

9(r - 1) 

SS(E) 


Total 

9r- 1 

SS(Total) 



For example ， 

釦 0 = ， (i ， .)-n-i，.)] 

where Y(l. ■) = Sf, = 0 y(l ， a ： 2 r)，and so on. Obviously, dio is the estimator for the 
linear effect of factor A, Al say. Similarly, 

知 o = ^[r(-l:.) —2Y(0，.）+ Y(1..)] 

is the estimator for the quadratic effect (that is, curvature) of factor A y Aq say. To 
mention one of the interaction parameters, consider 

«u = l{[y(Wi-) - y(i, ： -i)] - [y(—l, l) + 2/(-1,-l)]}, 

that is, a comparison of the linear effect of B at = 1 versus the linear effect of B at 
x\ = -1. We denote this interaction by Al x Bl ， Other interaction effects are defined 
and estimated similarly. 

Test of hypotheses about the regression coefficients in (11.49) can be made in con¬ 
junction with the ANOVA by partitioning SS(T) into eight single d.f. sums of squares 
each accounting for one of the regression coefficients. Details are given in Table 11.16. 
We note that the sums of squares are given for a CRD with r > 1 replications, with the 
appropriate changes in the estimated regression coefficients, for example, 


d 10 = -[r(i,.).-n-i-)-] 
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m). = y(l ， x 2l')m 

l/=Q m=l 

is the sum of all the observations with x\ — 1, etc. The method of computing the sums 
of squares is, of course, that given in (7.24). 

11.9.2 Extensions 

Extensions of this method to the general case of n factors each at three levels should 
now be obvious. As more factors are included in the experiment many more d.f. 
become available for interactions, not only for interactions between two factors, but 
three factors, four factors, and so on. For higher order interactions partitions into 
Al x Bl x Cl->Al x Bl x Cq, and so forth may not be particularly useful as these 
components will become difficult to interpret except that they are part of the A x 5 x C 
interaction. Moreover, just as with 2 n factorials, it is entirely likely that interactions 
involving three or more factors are negligible and that their sums of squares may be 
pooled with SS(E). 


11.9.3 Formal Definition of Main Effects and 
Interactions 

Our discussion of 3 n factorials so far has concentrated on quantitative factors with 
equally spaced levels. A more general representation dealing with other situations and 
in particular qualitative factors is obviously needed. We shall discuss such a method 
here briefly, deferring a more in-depth discussion to Chapter II. 10. 

We have seen in Section 11.3 that for the 2 n factorial each main effect and each 
interaction can be expressed as a contrast among all 2 n treatment combinations or, 
more precisely, among the true responses of all 2 n treatment combinations. Thus each 
main effect and interaction is represented by a single d.f. contrast. Moreover, these 
contrasts are mutually orthogonal. 

For the 3 n factorial we have seen above that each main effect, A say, consists of 
two comparisons, Al and Aq. And each 2-factor interaction, A x B say, consists 
of four contrasts, Al x Bi.Al x Bq. Aq x Bl and Aq x Bq. Extending this, 
each 3-factor interactions consist of 8 comparisons, and so on. Expressed alternatively, 
each main effect is represented by or accounts for 2 d.f., each 2-factor interaction for 
4 d.f., each 3-factor interaction for 8 d.f., and so on. To partition the 4 d.f. for 2- 
factor interactions into two orthogonal sets of 2 d.f. each and the 8 d.f. for 3-factor 
interactions into 4 mutually orthogonal sets of 2 d.f. each, Yates (1937) introduced 
what we might call interaction components. This notion and the mathematics of the 
method were formalized by Kempthorne (1952) and can be described as follows. 


Example 11.8: Let us consider the 3 3 factorial. We write a treatment combination 
as x — {x\.X 2 ， X 2 ,) where Xi denotes the level of the ith factor with = 0,1 ； 2 (z = 
1 , 2. 3). To simplify the notation we denote the three factors by A, B, and C. Further, 
let r(x) represent the true effect of the treatment combination x. 
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We know from Section 11.9.1 that the 2 d.f. corresponding to main effect A, say, 
are represented by comparisons among the treatment means with x\ = 0 , = 1 , and 
X\ = 2, that is, f(0. •, •) vs. f(1, •, •) vs. f(2, •, •). Here f (a ： i, •, •) for x\ = 0,1, 2 
is a mean of 9 effects averaging over the 9 treatment combinations (x\^X 2 , xs) with 
x 2 ,a ：3 = 0.1, 2 and fixed X\. Similarly, the main effects for B and C are represented 
by comparisons among treatment means with = 0 . X 2 = 1 , 尤 2 = 2 , and 0:3 = 
O.xs = l,xs = 2 , respectively. 

This idea of two comparisons among three treatment means or, alternatively, among 
three sets of treatment combinations can be carried over to the various interactions. As 
mentioned above the 2-factor interaction A x B is partitioned into two components. 
These components are denoted by AB and AB 2 and are defined as follows: AB is rep¬ 
resented by comparisons among the three means of treatments satisfying the equations 


$1 + 工 2 = 0 VS. X\ X2 = I vs. X\ J rX2= 2, 

where X\.X 2 = 0,1, 2 and all arithmetic is modulo 3. The second component, AB 2 , is 
represented by comparisons among means of treatments satisfying the equations 


X\ -h 2X2 = 0 V S. Xi + 2X2 = 1 VS. Xi + 2x2 = 2 

all mod 3. The remaining 2-factor interaction components, AC and AC 2 for AxC, and 
BC and BC 2 for B x C, are defined similarly. Finally, the four interaction components 
of the 3-factor interaction A x B x C are denoted by ABC, AB 2 C, ABC 2 . AB 2 C 2 , 
and are represented by comparisons among sets of treatment combinations satisfying 
the following equations 


ABC : xi X 2 xs = 0,1,2, mod 3 

AB 2 C : x\ + 2x2 + ^3 = 0.1.2. mod 3 

ABC 2 : 2 ：i + X 2 + 2x 3 =0,1,2, mod 3 

AB 2 C 2 : xi + 2x2 + = 0,1,2. mod 3. 

We summarize the above definition of the various main effects and interactions for the 
3 3 factorial in Table 11.17, giving the names of the effects/interactions, and interaction 
components together with their d.f. and the left-hand sides of the equations defining 
the partitions of the 3 3 treatment combinations (the right-hand sides are always 0, 1， 2 
mod 3). The reader should have no difficulties extending the procedure to 3 n factorials 
with n > 3. 

To motivate these definitions of main effects and interaction components we re¬ 
ferred to contrasts defining linear and quadratic contrasts as parts of main effects, for 
example, Al and Aq as parts of the main effect for factor A. In the context of the 
above discussion Al is represented by the comparison of treatment combinations sat¬ 
isfying xi = 0 vs. X\ — 2. Similarly, Aq is represented by the comparison of the form 
\{x\ =0 and X\ = 2} vs. Xi = 1. We know, of course, that the two contrasts are 
orthogonal. For the formal definition as given above this does not have to be the case. 
For example, the two comparisons for main effect A could be {xi = 0 vs. x\ = 1} and 
{xi = 0 vs. X\ = 2}. Another point we need to make here is that the comparisons for ， 
say, AB and AB 2 bear no relationship to Al x Bl,Al x Bq. Aq 乂 Bl ，and Aq x Bq. 
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Table 11.17 Main Effects and Interactions for the 3 3 Factorial 


Main Effect/Interaction 

d.f. 

Equation 

A 

2 


B 

2 

尤 2 

Ax B 

4 


AB 

2 

X\ 4 - x 2 

AB 2 

2 

X\ + 2X2 

c 

2 

工 3 

AxC 

4 


AC 

2 

Xi + Xs 

AC 2 

2 

Xi + 2a：3 

BxC 

4 


BC 

2 

X 2 + 尤 3 

BC 2 

2 

x 2 + 2^3 

Ax BxC 

8 


ABC 

2 

X 3 

ab 2 c 

2 

Xi + 2X2 + ^3 

ABC 2 

2 

xi J tx 2 J r 2 x 3 

AB 2 C 2 

2 

Xi + 2X2 + 2a：3 


These two representations simply refer to different partitions of the 4 d.f. for Ax B. 

. □ 

The formal definitions of main effects and interaction components as presented in 
this section are most valuable in considering suitable arrangements for 3 n factorials in 
incomplete blocks or in choosing suitable fractions of 3 n factorials. We shall illustrate 
this with a few simple examples. 


11.9.4 Systems of Confounding for the 3 n Factorial 

Let us consider the 3 3 factorial and blocks of size 9(= 3 2 ). To accommodate all 27 
treatment combinations in blocks of size 9 we need 3 blocks. The idea then is, just as in 
the case of 2 n factorial (see Section 11.6), to confound certain interactions with blocks. 
In our case we have 2 d.f. among 3 blocks, which means that we need to confound an 
interaction with 2 d.f. with blocks. More precisely, we shall choose an interaction 
component which does account for 2 d.f. as we have just explained. If, for example, 
the 3-factor interaction AxBxCis assumed to be negligible or unimportant we can 
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Table 11.18 Partition of 3 3 Treatment Combinations 


Set 1: 

^1 + ^2 + ^3 = 0 

Set 2: 

^1 + ^2 + ^3 = 1 

Set 3: 

xi x 2 xs = 2 

000 

100 

200 

111 

2 11 

011 

222 

022 

1 22 

120 

220 

020 

2 10 

0 10 

110 

102 

202 

002 

20 1 

00 1 

10 1 

0 12 

112 

2 12 

02 1 

12 1 

22 1 


choose any one of the four interaction components ABC, AB 2 C, ABC 2 , or AB 2 C 2 
to confound with blocks. They are all equally important or, in our case, unimportant. 
Suppose we choose ABC. We then partition the 3 3 treatment combinations into three 
sets according to the equations for ABC, namely: 

Set 1: xi-\-X 2 J rXz = 0 mod 3 
Set 2: x\ X 2 + xz = \ mod 3 
Set 3: x\-\- X2~\~ X2, = 2 mod 3. 


Each set contains, of course, exactly 9 treatment combinations. These sets are given 
in Table 11,18. The treatment combinations in a given set are then all assigned to the 
same block thus generating what we shall refer to as the basic arrangement in 3 blocks 
of size 9 each. For the actual experiment the basic arrangement may then be replicated 
r times giving us 3r blocks. 

Using the formal definitions of Section 11.9.3 we can verify easily that only ABC 
is confounded with blocks and that all other main effects and interactions can be esti¬ 
mated from this arrangement. For example, in each set (block) there are exactly three 
treatment combinations with X\ = 0, x\ = 1, and Xi = 2 which implies that any 
comparison among these three sets of treatment combinations is free of block effects. 

A consequence of this system of confounding is that although full information is 
obtained on all main effects and 2-factor interactions, only limited information on the 
3-factor interaction Ax B xC is available through the interaction components AB 2 C, 
ABC 2 , and AB 2 C 2 . This means that in the ANOVA table the 3-factor interaction 
A x B x C has only 6 d.f. 

This may be sufficient to get some idea whether 3-factor interaction is present. If, 
on the other hand, one is not willing to make the assumption that there is no interaction 
(as we did above) an obvious solution would be to use partial confounding (see Sec¬ 
tion 11.6.3). This can be done in various ways. We shall mention only two to illustrate 
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the general idea: 

(i) Use two basic replicates by confounding two of the 3-factor interaction compo¬ 
nents, say ABC and AB 2 C, one in each of the basic replicates. This allows 1/2 
information on ABC and AB 2 C and full information on ABC 2 and AB 2 C 2 , 
thus restoring the 8 d.f. for Ax B x C. 

(ii) If sufficient experimental material is available we could use four basic replicates 
confounding ABC, AB 2 C, ABC 2 , and AB 2 C 2 in one of the four basic repli¬ 
cates, respectively. This will yield then 3/4 information on all four components 
and hence on the 8 d.f. for: Ax B x C. 

The general ideas presented in Section 11.6.2 to construct systems of confounding for 
2 n factorials in blocks of size 2 l (l < n) can be extended to 3 n factorials in blocks of 
size 3^. For example, we may consider a 3 3 factorial in blocks of size 3. Obviously， 
several main effects or interaction components will have to be confounded with blocks; 
in this case four to be exact since we have 9 blocks and hence 8 d.f. among the blocks. 
These have to be chosen carefully. We shall not pursue this here any further but defer 
description of the general procedure to Chapter II. 10. 


11.9.5 Fractions of 3 n Factorials 


Even more so than for the 2 n factorial the number of treatment combinations for the 
3 n factorial may be far too large for practical applications. And again, if higher order 
interactions are considered to be negligible it may be entirely satisfactory to consider 
only a fraction of all possible treatment combinations and still obtain most if not all 
of the information needed. The easiest method is to consider 1/3 1 fractions of the 
3 n factorial {I < n). We shall give a simple example here and defer a more general 
description to Chapters 11.13 and 14. 

EXAMPLE 11.9: Let us consider a 1/3 fraction of the 3 3 factorial, that is, 9 out of the 
possible 27 treatment combinations. As in Section 11.7 the general idea is to assume 
that the highest-order interaction, in this case the 3-factor interaction, is negligible and 
use that fact to choose the treatment combinations to be included in the fraction. Recall 
that for each interaction component the set of all treatment combinations is partitioned 
into three subsets. For example for the component ABC the subsets will be obtained by 
satisfying the equations 


工 l + 尤 2 + 尤 3 = 0, 1， 2 mod 3. 


Any one of these subsets constitutes a 1/3 fraction which obviously does not allow 
estimation of contrasts belonging to ABC. What are other consequences? 

Since we are considering only 9 treatment combinations we have 8 d.f. for treat¬ 
ments, that is, there are 8 linearly independent comparisons among treatment effects 
that can be estimated. How can we identify these? Let us consider set 1 in Table 11.18 
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which constitutes the 1/3 fraction under consideration here: 

0 0 0 1 2 0 2 0 1 

111 210 012 
2 2 2 1 0 2 0 2 1 

The 2 d.f. associated with the main effect for factor A result from the comparisons of 

treatment combinations satisfying 

{x\ = 0} vs. {x\ = 1} vs. {x\ = 2}, (11.51) 

that is, 



A close look at the three sets of treatment combinations given in (11.52) shows that in 
addition to satisfying the equations in (11.51) they also represent comparisons satisfy¬ 
ing the equations 

{x 2 + o ；3 = 0} vs. {x 2 + x-s = 2} vs. {x 2 + x 3 = 1} (11.53) 

and 

{xi + 2 x 2 + 2x 3 = 0} vs. {xi + 2x 2 + 2 a ：3 = 2} 

vs. {^i + 2 x2 + 2a：3 = 1}. (11.54) 

This means that the contrasts belonging to A also belong to the interaction compo¬ 
nents BC and AB 2 C 2 as indicated by (11.53) and (11.54), respectively (see also Ta¬ 
ble 11.17). In the terminology of Section 11.7 we thus say that A, BC and AB 2 C 2 are 
confounded or aliased with each other. 

Using similar arguments we can also show that B, AC, and AB 2 C are confounded 
with each other, and so are C, AB, ABC 2 and, finally, AB 2 , AC 2 , BC 2 . To sum¬ 
marize, the alias structure for this fraction can be written as (using the convention of 
Section 11.7): 


A 

=BC = 

ab 2 c 2 


B 

=AC = 

--AB 2 C 

(11.55) 

C 

=AB = 

--ABC 2 


AB 2 

=AC 2 : 

= BC 2 . 



We have thus identified four sets of comparisons, each accounting for 2 d.f. In ad¬ 
dition these four sets are linearly independent of each other which means that we have 
indeed identified the 8 comparisons accounting for the 8 d.f. among the 9 treatment 
combinations. 
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Formally, (11.55) can be derived by a mathematical argument similar to that de¬ 
scribed in Section 11.7. We start with the identity relationship which determines the 
fraction, that is, the treatment combinations to be chosen as well as the alias structure. 
For our example the identity relationship is 

1 = ABC (11.56) 

or if we want to determine the fraction uniquely 

I = ABCq, 

which means that we choose the treatment combination satisfying Xi + X 2 xz = 0 
rather than the other two possibilities, that is, + ^2 + ^3 = 1 or + ^2 + ^3 — 2. 
To obtain the alias structure we proceed as follows: 

(i) We consider (11.56) as a mathematical equation with I being the identity. 

(ii) We multiply each effect, that is, main effect or interaction component, formally 
into both sides of (11.56); 

(iii) The power for each letter is reduced mod 3 if necessary and any letter raised to 
power 0 is deleted from the expression. 

(iv) If the first letter with nonzero power is raised to the power 2 then the entire 

expression is squared and again reduced mod 3 (this is done to adhere to the 
convention for having a unique enumeration of all possible effects such as given 
in Table 11.17). ^ 

(v) In addition each effect is multiplied in the same way into (ABC) 2 to obtain the 
second alias. 

To illustrate these steps we shall find the aliases of A, namely, 

A = A(ABC) = A(ABC) 2 

= a 2 bc = a 3 b 2 c 2 

=(A 2 BCf = B 2 C 2 
=A i B 2 C 2 = (B 2 C 2 ) 2 

=ab 2 c 2 = b 4 c 4 

=AB 2 C 2 = BC. (11.57) 

In Section 11.6.2 we have referred to the generalized interaction, XY say, for two 
effects X and Y in the 2 n system. In the 3 n system we have for any two effects X 
and F, say, not only one but two generalized interactions, denoted by XY and XY 2 . 
Using the concept of the generalized interaction we can then also say that, for example, 
A is aliased with the generalized interactions of A and ABC, as given in (11.57). The 
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remaining aliases in (11.56) can be obtained in the same way. Finally, we note that 
together with ABC in (11.56) all possible effects for the 3 3 factorial (see Table 11.17) 
are accounted for in (11.55). □ 

The fractional factorial design we have discussed above is clearly of no practical 
value unless in addition to the assumption of zero 3-factor interaction we can also 
assume that all 2-factor interactions are zero or negligible. Then we can obtain infor¬ 
mation about main effects. This is another example of a main-effect plan or a resolution 
III design. 

11-9.6 Highly Fractionated 3 n Factorials 

The reader should have no difficulty extending the ideas of the previous section and 
consider, for example, a 1/3 fraction of the 3 4 factorial leading to a resolution IV design 
or a 1/3 fraction of the 3 5 factorial leading to a resolution V design, and so on. But even 
a 1/3 fraction of a 3 5 factorial may be impractical as it leads still to too many treatment 
combinations. The problem becomes even more critical for 3 n factorials with larger 
n. And it is not uncommon to have many factors in an experiment, in particular in 
an exploratory experiment. The need for highly fractionated factorials becomes then 
obvious, such as a 1/9 fraction of a 3 5 or 3 6 , or a 1/27 fraction of a 3 6 factorial, and so 
on. 

Designs of the form mentioned above can be developed by combining and extend¬ 
ing the ideas and rules given in Sections 11.7.4 and 11.9.5. In particular, the identity 
relationship now contains several interaction components; some are chosen indepen¬ 
dently and others represent the generalized interactions of those chosen interaction 
components. For example, for a 1/3 2 fraction we can choose two interactions freely, 
say X and Y, so that 

I = x = Y = XY = XY 2 

determines the treatment combinations to be included and also the alias structure. It is, 
of course, important to have an alias structure which allows us, under certain assump¬ 
tions, to estimate the effects and interactions in which we are interested. This is not 
always easy to do and care must be used to choose X and Y appropriately, or X, Y, Z, 
say, for a 1/3 3 fraction, etc. We shall not pursue this any further here, but some rules 
will be developed in Chapters 11.13 and 14, 

11.9.7 Systems of Confounding for Fractions of 3 n 
Factorials 

Even for a reasonable fractional factorial the number of treatment combinations may 
be too large for a suitable error-control design. For example, for the 1/3 fraction of the 
3 3 factorial we have 9 treatment combinations but the error-control design available 
may call for blocks of size 3. It becomes then necessary to use an incomplete block 
design and confound some effects with blocks, that is，use a system of confounding as 
described in Section 11.9.4. 
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To obtain a reasonable, that is, useful system of confounding we have to choose 
carefully the effect or effects to be confounded with blocks in order not to sacrifice 
needed information. To make that choice we have to consult the alias structure to see 
which effects can be estimated (if there were no blocking) and which of these are least 
important. These would be typically higher order interactions. Let us consider our 
example of Section 11.9.5 and suppose we have available blocks of size 3. We then 
need three blocks for a basic replicate. That means we need to confound 2 d.f. with 
blocks and since each effect in a 3 n factorial accounts for 2 d.f. we need to confound 
one effect with blocks. Inspection of the alias structure (11.55) shows that if we do not 
want to confound a main effect with blocks the only choice is to confound AB 2 (and 
its aliases) with blocks. Using the procedure of Section 11.9,4 we construct the blocks 
by finding the treatment combinations (among the 9 chosen for the fraction) satisfying 
the equations 


X!-h 2x2 = 0 ： 000 , 111， 222 
Xl ^r 2 x 2 = 1 ： 210,102,021 
+ 2x 2 = 2 : 120,201,012 

and assign them to blocks 1 ， 2, 3, respectively. Suppose we have r replications of this 
basic arrangement, that is, 3r blocks altogether. Then the structure of the ANOVA 
is as given in Table 11.19. The important point is that there are now only 6 d.f. for 
treatments which are partitioned into the main effects A, B, and C each with 2 d.f. 

This simple example should convey the general idea that the construction of sys¬ 
tems of confounding for fractional factorials follows the same rules as for full facto¬ 
rials. The effects to be confounded are obtained from the alias structure. With each 
effect its aliases are also confounded with blocks. And if several effects need to be 
confounded with blocks then all their generalized interactions are confounded with 
blocks too. This makes this process not always easy and as a consequence sometimes 
confounding of desirable effects cannot be avoided. Systems of partial confounding 
may be helpful. 

11.10 EXPERIMENTS WITH FACTORS 
AT TWO AND THREE LEVELS 

11.10.1 Asymmetrical Factorial Experiments 

So far we have discussed two extreme types of factorial experiments: On the one ex¬ 
treme we have n factors all with possibly different numbers of levels; on the other 
extreme we have n factors all having the same number of levels, for instance, 2 or 3. 
These two types are referred to as asymmetrical (mixed) factorials and symmetrical 
{pure) factorials, respectively. 

In practical applications special kinds of asymmetrical factorials are often used. We 
may have two or three groups of factors where all factors in the same group have the 
same number of levels. Of particular interest are 2 m x 3 n experiments, that is, m factors 
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Table 11.19 Outline of ANOVA for 1/3 Fraction of 
3 3 Factorial in Blocks of Size 3 


Source 

d.f. 

Blocks 

3r- 1 

A 

2 

B 

2 

C 

2 

Error 

6(r - 1) 

Total 

9r — 1 


with 2 levels each and n factors with 3 levels each. Their use has been advocated and 
promoted by Taguchi (1986 ， 1987) (see also Roy, 1990) in his parameter designs for 
off-line quality control. Of special importance in these applications are fractions of 
2 m x 3 n factorials (see Section II. 17.4.1. 

The construction of such fractions and of systems of confounding borrows heavily 
from the methods we have discussed in earlier sections. We shall give a few examples 
to illustrate the main ideas, but leave a more thorough discussion for Chapters 11.12 and 
13. — 


11.10.2 Confounding in 2 m x 3 n Factorials 

To use the methods of constructing systems of confounding described in Sections 11.6 
and 11.9.4 we need to confine ourselves to blocks of size 2 P x 3 q with p ^ m, q n. 
The general idea is to either combine a system of confounding for the 2 m factorial 
with the complete 3 n factorial, or a system of confounding for the 3 n factorial with the 
complete 2 m factorial or，as a third possibility, combine systems of confounding for 
both factorials. We shall illustrate this for the 2 2 x 3 2 factorial with blocks of size 18, 
12, 9, 6, and 4. 

Let us denote the treatment combinations by (xi, a ： 2 , ^i, 22 ) where X\, X 2 = 0,1 
represent the levels of the 2 2 factorial with factors A, B and zi, Z 2 = 0,1. 2 those of 
the 3 2 factorial with factors C, D. Further, let Si denote the ith set of treatment combi¬ 
nations for a system of confounding for the 2 2 factorial and Sj the jth set of a system 
of confounding for the 3 2 factorial. Combining sets Si and Sj in an appropriate way, 
referred to as a Kronecker product design ，constitutes then a system of confounding for 
the 2 2 x 3 2 factorial. These can be described briefly as follows. 
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Blocks of Size 18: 

Confounding AB with blocks of size 2 gives S\ = {(0, 0), (1, 1)}, S 2 = {(1, 0), (0, 
1)}. With S f ={( 0 , 0 ), ( 1 ， 0 )，（ 2 , 0 )，（ 0 , 1 )，（ 1 ， 1 )，（ 2 , 1 )，（ 0 , 2), (1，2)，（2, 2)} we 
then consider S\S f and S 2 S 1 . This means we adjoin every treatment combination in 
Si(i — 1,2) with every treatment combination in S\ giving us two sets of 18 treatment 
combinations (xi， 0 : 2 ,之 1 , 之 2 ). Each set represents a block. These two blocks form the 
basic arrangement which can then be replicated r times. Except for the interaction AB 
(1 d.f.) the main effects for A, B, C, D and all other interactions are estimable. 

An alternative to replicating the basic arrangement is to use partial confounding of 
A ， B ， and AB. Such a plan yields then partial information about these three effects and 
full information about all other effects. 

Blocks of Size 12: 

For this situation we generate three sets S f s by confounding, for example, CD. 

We then have S[ = {(0,0), (1,2), (2,1)}, 5^ = {(1,0)，(0,1)，(2, 2)}，and 苑 = 
{(2,0)，(0,2)，(1 ; 1)}. With S = {(0,0), (1,0)，(0,1)，(1,1)} we form SS^SS^SS^ 
which yields three blocks of size 12. This basic arrangement needs to be replicated r 
times. Alternatively, some system of partial confounding for the 3 2 factorial may be 
used so that information about all main effects and interactions may be obtained. 

Blocks of Size 9: 

The only design in this class is obtained by confounding A, B, and AB, that is, by 
forming Si = {(0,0)}, S 2 — {(1,0)}, S 3 = {(0,1)}, 5*4 = {(1,1)}. These sets are 
then combined with S f = {all treatment combinations for 3 2 factorial}. This arrange¬ 
ment is obviously of no practical value unless the 2 2 factorial itself is not important but 
only the 3 2 factorial and interactions between factors with 2 and 3 levels, for example, 
AxC,BxC,AxBxC, etc. 

Blocks of Size 6: 

This is the only situation where we combine systems of confounding for both the 2 2 and 
3 2 factorials. One possibility is to confound AB generating S\ = {( 0 , 0 ),(Ll)}and 
^2 = {(1,0), (0,1)}, and to confound CD, say, generating S[ = {(0,0). (1,2). (2.1)}， 
S 2 = {(1,0), (0,1), (2,2)} and S f 3 = {(2,0), (0, 2), (1,1)}. The six combinations 
SiS’j {i = 1,2; j = 1,2,3) then yield six blocks of size 6 . We should note here that in 
addition to AB and CD also the generalized interaction AB x CD is confounded with 
blocks. There exist, obviously, other possibilities of forming the Si and and various 
system of partial confounding can be used to obtain the desired amount of information 
about main effects and interactions. 

Blocks of Size 4: 

As with the case of blocks of size 9, this design is generally of no practical value as 
all effects of the 3 2 are confounded with blocks. We combine S\ = {(0,0), (1,0)， 
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000 
11 0 
10 1 
Oil 


For the 3 2 factorial we consider 1/3 fractions based on the identity relationship 


(0,1) ，（1 ,1)} with Sj (j = 1,2, … ， 9) where each 另 contains only one treatment 
combination from the 3 2 factorial. 

The method just described is obviously quite simple and can be extended easily 
to other factorials. It does, however, not always lead to the most practical or suitable 
arrangements. Using a different notion of partial confounding and employing other 
types of incomplete block designs we shall discuss other methods in Chapter 11.12 (for 
a listing of some useful designs see II. Appendix D). 

11.10.3 Fractions of 2 m x 3 n Factorials 

The idea of considering the symmetrical factorials separately and then adjoining treat¬ 
ment combinations from those factorials in an appropriate manner can also be used to 
construct useful fractions of asymmetrical factorials. Connor and Young (1961) have 
devised such a method for 2 m x 3 n factorials. Their designs are such that they al¬ 
low the estimation of all main effects and 2-factor interactions assuming that all other 
interactions are negligible. 

We shall give only one example here to illustrate the method and refer the reader to 
the catalog of designs provided by Connor and Young (1961) as reprinted in McLean 
and Anderson (1984). We consider a 1/2 fraction of the 2 3 x 3 2 factorial with factors A, 
B, C having 2 levels and D, E having 3 levels. To this end we consider a 1/2 fraction 
of the 2 3 factorial based on the identity relationship 



This leads to a partition of the 8 treatment combinations into two sets, Si and S 2 , ac¬ 
cording to the sign with which the treatment combinations into ABC (see Section 11,6) 
or, alternatively, according to whether they satisfy the equations 

51 : X 2 + xs = 0 mod 2 

or 

5 2 ' x\ X 2 + X 3 = l mod 2. 

We thus obtain Each set represents a 1/2 fraction of the 2 3 factorial 

5! S 2 


0101 

11 c _ u 11 


I = DE. 
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This leads to a partition into three sets 5(, 5^, S f 3 based on the equations 

S[ : zi Z 2 = 0 mod 3 

: 2i + :2 = 1 mod 3 
S f 3 : zi Z 2 = 2 mod 3, 

that is, 


S[ 

S' 2 


00 

10 

20 

1 2 

0 1 

02 

2 1 

22 

11 


The 1/2 fraction for the 2 3 x 3 2 factorial is then obtained by adjoining the two types of 
sets as follows: 

5 2 5^ 5 2 S^ 

Each set SiSj consists of 4 x 3 = 12 treatment combinations. The final design is given 
in Table 11.20. 

Since both 1/2 fractions of the 2 3 factorial and all three 1/3 fractions of the 3 2 fac¬ 
torial are used to obtain the final design it is possible to estimate all main effects and all 
2-factor interactions. As a consequence these types of designs are still quite large and 
other designs may have to be considered for practical applications. Of particular inter¬ 
est then are main effect plans as developed, for example, by Addelman and Kempthorne 
(1961) and Addelman (1962). Such methods are discussed in Chapter 11.14. 


Table 11.20 1/2 Fraction of 2 3 x 3 2 Factorial 


Si 5 ； S 2 S , 2 S 成 


00000 

00012 

00021 

11000 

11012 

11021 

10100 

10112 

10121 

01100 

01112 

01121 


10010 

10001 

10022 

01010 

01001 

01022 

00110 

00101 

00122 

11110 

11101 

11122 


10020 

10002 

10011 

01020 

01002 

01011 

00120 

00102 

00111 

11120 

11102 

mu 
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11.11 EXAMPLES USING SAS® 


Example 11.10: We consider here a purely numerical example to illustrate the roles 
of error-control, treatment and sampling design. We use a design of partial confounding 
of a 2 2 factorial in blocks of size 2 with subsampling (two observations per EU). The 
design and the data are given in Table 11.21b. 

We use both SAS PROC GLM and SAS PROC MIXED to analyze the data. The 
main reason for using PROC GLM is to obtain an ANOVA table. The results of both 
analyses are given in Table 11.21b, based on the input statements given in Table 11.21a. 

We make the following comments on the input and output: 

(i) In order to obtain the correct test for A, B, A ^ B have to specify in PROC 
GLM E = block * A * B as the correct error term, that is, the experimental error. 
In PROC MIXED this is done correctly automatically by declaring block* 4 * 丑 
as a random effect. 

(ii) The observational error variance component is estimated as = 1.2917 (in 
GLM in the basic ANOVA and in MIXED as Residual). 


(iii) The experimental error variance component is estimated in MIXED as Sf = 
block= 1.7292 We can obtain the same value in GLM from MS (block* 
A * S)as 


a 2 e = [MS (block * - MS(ERROR)]/2 

=(4.75 - 1.2917)/2 = 1.7292 


(iv) Both analyses produce the same results for testing hypotheses about A, B and 
A ^ B. 

(v) The estimates for A, B, and A * B ait obtained in MIXED by specifying the 
appropriate contrasts. Since all effects are confounded in one (out of three) repli¬ 
cates, they are all estimated with the same variance, namely, 

var(d) = var(B) = var(A * B) 

(J^ + nof 
r*n 


where r* is the effective number of replications and n is the size of the subsam- 
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pie. As 


i consequence, 
se{A )= 


se(B) 


^e(A * B) 

fMS(EE)\ 1/2 
\ r*n ) 

^MS(block*A*5)\ 1/2 


⑺ 1 


: 1.0897. 


(vi) The LS means (which are not available in GLM) can also be used to obtain the 
estimates for A, B, and A* B. 

(vii) The d.f. for testing hypotheses about A, B 9 and >1 * B are the d.f. for experimen¬ 
tal error. They are obtained as 

# of EUs - 轉 of treatments — # of blocks +1 

= 12 — 4 — 6 +1 

= 3 (see Table 9.13). □ 


Table 11.21 2 2 Factorial in Blocks of Size 2 


run; 

proc print data^factorial; 

title 1 ’DATA FOR 2**2 FACTORIAL'; 

title2 'IN INCOMPLETE BLOCKS'; 

title3 ’WITH SUBSAMPLING ，； 

run; 

proc glm data=factorial; 
class block A B; 

model y = block A B A*B block*A*B; 
testH = AB A*BE = block*A*B; 
titlel ’ANALYSIS OF 2**2 FACTORIAL’; 
run: 
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Table 11.21 (Continued) 

proc mixed data=factorial; 
class block A B; 
model y = block A B A*B; 
random block* A*B; 
lsmeans A B A*B; 
estimate ’A’ A -1 1; 
estimate ’B’ B -1 1; 

estimate ’A*B’ A*B -111 -l/divisor=2; 
run; 

b.) Output: 


DATA FOR 2**2 FACTORIAL 
IN INCOMPLETE BLOCKS 
WITH SUBSAMPLING 

Obs block A 

110 
2 10 

3 11 

4 11 

5 2 1 

6 2 1 

7 2 0 

8 2 0 

9 3 1 

10 3 1 

11 3 0 

12 3 0 

13 4 0 

14 4 0 

15 4 1 

16 4 1 

17 5 0 

18 5 0 

19 5 0 

20 5 0 

21 6 1 

22 6 1 

23 6 1 

24 6 1 


ANALYSIS OF 2**2 FACTORIAL 
The GLK Procedure 
Class Level Information 
Levels Values 

6 1 2 3 4 5 6 

2 0 1 

2 0 1 


Class 

block 

A 

3 



Number of Observations Read 
Number of Observations Used 
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Table 11.21 (Continued) 


Dependent Variable : y 


Source 

DF 

Sum of 
Squares 

Mean Square 

F Value 

Pr > F 

Model 

11 

171.1250000 

15.5568182 

12.04 

<.0001 

Error 

12 

15.5000000 

1.2916667 



Corrected Total 

23 

186.6250000 





R-Square Coeff Var Root MSE y Mean 

0.916946 14.43194 1.136515 7.875000 


Source 

DF 

Type I SS 

Mean Square 

F Value 

Pr > F 

block 

5 

108.3750000 

21 . 6750C00 

16.78 

<.0001 

A 

1 

4.0000000 

4 .OOOCOOC 

3.10 

0.1039 

B 

1 

42.2500000 

42.250000C 

32.71 

<.0001 

A^B 

1 

2.2500000 

2.2500000 

1.74 

0.2115 

block*A^B 

3 

14.25000C0 

4.7500000 

3.68 

0.043 6 


Source 

DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

block 

5 

31.41666667 

6.28333333 

4.86 

0.0116 

A 

1 

4.00000000 

4.00000000 

3.10 

0.1039 

B 

1 

42.25000000 

42.25000000 

32.71 

〈 •C001 

AxB 

1 

2.25000000 

2.2500000C 

1.74 

0.2115 

block^A*B 

3 

14.25000000 

4.75000000 

3.68 

0.C436 


Tests of Hypotheses Using the Type III MS for block*A*B as an Error Term 


Source 

DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

A 

1 

4.00000000 

4.00000000 

0.84 

0.4265 

B 

1 

42.25000000 

42.25000000 

8.89 

0.0585 

A*3 

1 

2.25000000 

2.25000000 

0.47 

0.5407 


The Mixed Procedure 
Model Information 


Data Set 


WORK.FACTORIAL 


Dependent Variable 
Covariance Structure 
Estimation Xerhod 
Residual Variance Method 
Fixed Effects SE Method 


y 

Variance Components 

REML 

Profile 

Model-Based 


Degrees of Freedom Method Containment 
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Table 11.21 {Continued) 


Iteration History 

Evaluations -2 Res Log Like Criterion 

1 63.93019485 

I 61.40414523 0.00000000 

Convergence criteria met. 


ANALYSIS OF 2**2 FACTORIAL 

The Mixed Procedure 

Covariance Parameter 
Estimates 

Cov Parm Estimate 

block*A*B 1.7292 

Residual 1.2917 


Type 3 Tests of Fixed Effects 


Effect 

Wum 

DF 

Den 

DF 

F Value 

Pr > F 

block 

5 

3 

1.32 

0.4353 

A 

1 

3 

0.84 

0.4265 

B 

1 

3 

8.89 

0.0585 

A*B 

1 

3 

0.47 

0.5407 


Estimates 


Label 

Estimate 

Standard 

Error 

DF 

t Value 

Pr > |tI 

A 

1.0000 

1.0897 

3 

0.92 

0.4265 

B 

3.2500 

1.0897 

3 

2.98 

0.0585 

A* B 

0.7500 

1.0897 

3 

0.69 

0.5407 


Least Squares Means 
Standard 

Effect A 3 Estimate Error DF t Value Pr > It I 

A 0 7.3750 0.7034 3 10.48 0.0019 

A 1 8.3750 0.7034 3 11.91 0.0013 

B 0 6.2500 C.7034 3 8.89 0.0030 

B 1 9.5000 0.7034 3 13.51 0.0009 

A*B 0 0 5.3750 1.0433 3 5.15 0.0142 

A*B 0 1 9.3750 1.0433 3 8.99 0.0029 

A*B 1 0 7.1250 1.0433 3 6.83 0.0064 

A*3 1 1 9.6250 1,0433 3 9.23 0.0027 
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EXAMPLE 11.11: Using SAS PROC FACTEX we generate the 2 3 factorial design 
in blocks of size 4 as given in Section 11.6.1 The input statements are given in Table 
11.22a. We make the following comments: 

(i) The “factors” statement gives the names of the factors. The default is that these 
factors have 2 levels. 

(ii) The “blocks” input specifies the size of the blocks. 

(iii) The “model” statement indicates which effects and interactions we want to esti¬ 
mate; in this example we specify the main effects and two-factor interactions as 
indicated. If we specify too many effects and interactions no suitable system of 
confounding may exist. 

The output is given in Table 11.22b: 

(iv) The design is given in two forms, first in the standard order of the treatment com¬ 
binations, second (because of the output statement) in the order of the blocks. We 
note here that the levels of the factors are labeled as —1 and 1, which corresponds 
to our notation of 0 and 1. 

(v) The “Block Pseudo-factor Confounding Rules” gives in general the names of the 

interactions which were chosen to generate the system of confounding. In our 
example it is the three-factor interaction ABC. □ 


Example 11.12: We use SAS PROC FACTEX to generate a system of confounding 
for the 2 5 factorial in blocks of size 8. The input statements and the output are given in 
Table 11.23a, b, respectively: 

(i) We denote the factors by /i,../s ， 

(ii) We want to estimate all main effects and two-factor interactions. 

(iii) The interactions to be confounded with blocks are given as /2 * /3 * /4 * h and 

/i * /4 * / 5 , and consequently their generalized interaction /i * /2 * / 3 , thus 
achieving the stated objective. □ 


Example 11.13: Here we use SAS PROC FACTEX to generate a fractional facto¬ 
rial design. Specifically, we consider the 1/2 fraction of the 2 3 factorial (see Section 
11.7.2). We comment briefly on the input statements and the output given in Table 
11.24: ^ 

(i) We have given two equivalent input statements which will generate the same 
design: 
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Table 11.22 2 3 Factorial Design in Blocks of 4 


a) Input statements: 

proc factex; 
factors ABC; 
blocks size=4; 

model estimate=(A[B|C @2); 
examine design confounding; 
output out=design blockname=block nvals=(l 2); 
title ’2 料 3 FACTORIAL IN BLOCKS OF SIZE 4’; 

run; 

proc print data=design; 
run; 

b.) Output: 


2**3 FACTORIAL IN BLOCKS OF SIZE 4 
The FACTEX Procedure 
Design Points 

Experiment 

Number ABC Block 


1 

2 

2 

1 

2 

1 

1 

2 


Block Pseudo-factor Confounding Rules 
[Bl] = A*B*C 

2**3 FACTORIAL IN BLOCKS OF SIZE 4 
Obs block. ABC 
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BIock Pseudo-fact or Conroundmg Ru^es 


[31] 

[B21 


f2*f3^f4*f5 


Table 11.23 2° Factorial Design in Blocks of 4 


a) Input statements: 

proc factex; 
factors fl-f5; 
blocks size=B; 

model estimate=(f 11f21D|f41f5 @2); 
examine design confounding; 

title ’2**5 FACTORIAL DESIGN IN BLOCKS OF SIZE 8 ’； 
run; 

b) Output: 


2**5 FACTORIAL DESIGN IN BLOCKS OF SIZE 8 
The FACTEX Procedure 
Design Points 

Experiment 

Number fl f2 f3 f4 f5 Block 


2332 1441 1441 2 33241143223322 3 41 - 丄 4 


1 * Ti IX Tx II- 


T- IX 1 i Ti 1 * Ti -IX 


I i ― ^ 1 1 f —- 1 一 ― I 一 ― I < — I I — I t — 11 i — I I — I * - 

_ - _ - - _ I 


I 7i 1 l'l tl1i 1 - 


i — I I — I I ― I < ― I i — II * — I -— I 一 —- 1 1 t ― I f ― I I —- I — I < — I r I < — I tx 

---_ - - - _ - - I - 


1.1 1 - - - 1 - 


I — I t-H < ―- IX IX IX i — * IX 1 

-------- 


I -— I - 1丄 i ― I * ― I r - - ^I -i1 -• X 

-------- 


一丄 1 ― * 1 — I ^I -― I f — I * — I 1 —- f — I I — I l —- -—- I―II * —I 

---------------- 
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Table 11.24 1/2 Fraction of 2 3 Factorial 


a) Input statements: 


proc factex; 

factors ABC; 

size design=4; 

model estimate=(A B C); 

examine design aliasing confounding; 

title ’1/2 FRACTION OF 2**3 FACTORIAL’: 

run; 

proc factex; 

factors ABC; 
size fraction=2; 
model res=3; 

run; 

b.) Output: 


1/2 FRACTION OF 2*^3 FACTORIAL 
The FACTEX Procedure 
Design Points 

Experiment 

Number ABC 


2 -： 

3 1 

4 1 


Factor Confounding Rules 
C = A*B 



Aliasing Structure 


A = B*C 
E = A*C 
C = A*B 
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The size of the design can be stated either as “size design = # of design points 
(runs)” or as “size fraction = denominator of fraction ”， 

The model statement can be given either in terms of effects and interactions to 
be estimated or in terms of the resolution of the design (see Section 11.7.4). 

(ii) The design points are given in Table 11.24b. 

(iii) The “Factor Confounding Rules” gives an expression equivalent to the defining 
relationship (see (11.35). The expression 

C = B 

can be interpreted as the method of constructing the design: Starting with the 
full 2 2 factorial with factors A and B the levels of factor C are obtained by 
multiplying for each run the corresponding levels of A and B. 


(iv) The “Aliasing Structure” is obtained as explained in Section 11.7.3. □ 


Example 11.14: In this example we use SAS PROC FACTEX to combine fraction¬ 
ation and confounding. Specifically, we consider the 1/16 fraction of the 2 s factorial 
in blocks of size 8 in the form of a resolution IV design. 

The input statements and the output are given in Table 11.25: 

(i) The 16 design points are given in Table 11.25b, assigning them to two blocks. 

(ii) The factor confounding rules specifies four interactions in the defining relation¬ 
ship, each consisting of four factors, that is, 

I = /2 * /3 * /4 * /s = /l * /3 * /4 * /6 
=/l * /2 * /4 * A = /l * /2 * /3 * fs 

to which should be added all their generalized interactions. 

(iii) The block pseudo-factor rule specifies one (“estimable” ） four-factor interaction 
to generate the system of confounding. 

(iv) The alias structure indicates that all main effects are estimable (assuming that 
three-factor interactions are negligible) and which two-factor interactions are 
aliased with each other. 


(v) The two-factor interactions indicated by [B] are confounded with blocks since 
they are aliased with /i * /2 * /3 * A in [Bl]. □ 
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Factor Confounding Rules 

f5 = f2*f3*f4 
f6 = fl*f3*f4 
f7 = fl*f2*f4 
f8 = fl*f2*f3 


Block Pseudo-factor Confounding Rules 


Table 11.25 Fractional Replication with Confounding 


a) Input statements: 

proc factex; 

factors fl-f8; 
size design= 16; 
blocks size=8; 
model res=4; 

examine design aliasing confounding; 

title 1 ’ 1/16 FRACTION OF 2**8 FACTORIAL，; 

title2 ’IN BLOCKS OF SIZE 8 ’； 

run; 

b.) Output: 


1/16 FRACTION OF 2**8 FACTORIAL 
IN BLOCKS OF SIZE 8 

The FACTEX Procedure 

Design Points 

Experiment 

Number f1 f2 f3 f4 f5 f6 f7 f8 Block 


* 1 I ― 1 I — I V — I 1 ― I I ― I 1 ― I 1 ― I 1 — i I ― I I ― I V ― _ 1 1 

- - -111 

* 1 1 1 ― II I — I f — I I — I I ― II- 1 - ^ 一 - ^I «__I 一 
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11111111111111 

- -- -- -I 

1 rxrH 1 1 1 -丄 rJ.1 il II 1 1-1 rll 

--- ---- 

1111111111111- 

------- 


10 12 3 4 5 6 

一 ^ 一 _^- -^ - - _ - 1 _ | -I 


[Bl] = fl*f2*f3*f4 
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Table 11.25 (Continued) 


Aliasing Structure 


[B ； 


fl 

f2 

f3 

f4 

f5 


fl 

f8 

fl*f2 = f3*fS = 
f1*f3 = f2*f8 = 
fl*f4 = f2*f7 = 
=flxf5 = f2*f6 = 
fl*f6 = f2*f5 = 
fl*f7 = f2*f4 = 
fi*f8 = f2*f3 = 


f4xf7 = 
f4xf6 = 
f3*f6 = 
f3*f7 = 
f3*f4 = 
f3*f5 = 
f4*f5 = 


f5*f6 

f5*f7 

f5*f8 

f4*f8 

f7*f8 

f6*f8 

f6^f7 


11.12 EXERCISES 

11.1 Show that for the 2 4 factorial the main effects and interactions represent a com¬ 
plete set of orthogonal contrasts among the 16 treatment combinations. 

11.2 Consider a 2 2 factorial experiment in a randomized complete block design with 
b blocks. 


(i) Define, in terms of the true treatment effects, the main effects and two- 
factor interaction. 

(ii) Show that the main effects and the two-factor interaction are orthogonal 
contrasts among the treatment effects. 

(iii) Suppose each experimental unit has 3 observational units. Give an expres¬ 
sion for the variance of the estimators for the main effects and interaction. 

(iv) Outline the ANOVA table for the design with b blocks and 3 observational 
units per experimental unit giving source of variation, d.f” JS(MS), and the 
F -ratios for testing hypotheses about the main effects and interaction. 


11.3 Consider the following block designs with 5 blocks and with treatments having 
a 2 2 factorial structure (with factors A and B, say): 


(a) randomized complete block design with 2 samples per EU, and 2 measure¬ 
ments per sample; 

(b) generalized randomized block design with 4 replications for each treatment 
per block; 

(c) generalized randomized block design with 2 replications for each treatment 
per block, and 2 observations per EU. 


For each design: 
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(ii) Write out an appropriate linear model. 

(ii) Outline the ANOVA table, giving sources of variation, d.f. (for the d.f. give 
numbers not formulas), and E(MS). 

(iii) Indicat how you would test for main effects and interactions. 

(iv) Give the variance for A and give its estimator, that is, var(A). 

11.4 Consider a 2 2 factorial experiment in a completely randomized design with r 
replications for each treatment combination. Suppose that for each observation, 
y, information on a covariate, x, is available. 


(i) Using the supplementary information, give the general expression for the 
(adjusted) estimator for the main effect A. 

(ii) Let A yy , B yy , (AB) yy and E yy be the sums of squares for A, B, AB 
and Error, respectively, in the ANOVA table without the covariate. Using 
similar notation, give general expressions for the corresponding sums of 
squares when the covariate is included in the analysis. 


11.5 Suppose you are consulted to help design an esperiment involving two factors 
at two levels each. A sufficient number of blocks of size two are available for 
the experiment. The investigator wishes to obtain equal information on the main 
effects and the two-factor interaction. 


(i) Give the name of the method used for constructing a suitable experimental 
design. 

(ii) Write out explicitly the design for this study and explain how you obtained 
it. 

(iii) Outline the ANOVA table for the design given in (ii), including source of 
variation, d.f., and sums of squares. 

(iv) Suppose only 6 blocks of size 2 are available for the experiment. What 
method could one use to construct the design. Explain and give the design. 
What kind of design is this? 


11.6 A horticultural experiment conducted in a green house was laid out as a Latin 
square design, where the blocking factors represent temperature and light inten¬ 
sity, respectively. The treatments have a 2 2 factorial structure, that is, 2 factors 
A and B each at 2 levels. The layout of the design and the results from the ex¬ 
periment (in parentheses) are given below: 
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Temperature 

Light Intensity 

1 

2 

3 

4 

1 

dob 。 

aih 

a 0 bi 

ai&o 


(5) 

( 10 ) 

( 8 ) 

(7) 

2 

aibi 

aib 0 

do 60 

a 0 bi 


( 12 ) 

⑻ 

( 6 ) 

( 10 ) 

3 

aib 0 

a 0 bi 

a 】 6 i 

a 。60 


( 10 ) 

⑻ 

(15) 

(7) 

4 

0-obi 

0-01>0 


aih 


(9) 

(9) 

( 11 ) 

(16) 


(i) Give a linear model for analyzing the data from this experiment and sketch 
the ANOVA giving sources of variation and d.f. 

(ii) Obtain the ANOVA table. 

(iii) Give a numerical expression for the estimate of the interaction Ax B. 

(iv) The experiment is to be repeated at different times，so that in the end data 
from 3 different times, Ti, T 2 , say, will be available (the experiment 
above represents Ti). Even though the temperature and light intensity 
trends remain, they may be assumed to differ from one time period to the 
next. It is expected that there is interaction between the time factor and the 
treatment factors. 

Give a linear model for data from this experiment and sketch the ANOVA 
table, giving sources of variation and d.f. 

(v) For the experiment described in (iv) what is the variance of the estimated 
main effects, A, B, and interaction, ABl 

11.7 Suppose a dermatologist wants to study the effectiveness of two (2) different 
preparations of a skin lotion using two ( 2 ) different forms of application (for 
example, one vs. two applications per day). He has available 12 patients with 
a certain skin disease and he can apply one form of medication (that is, combi¬ 
nation of preparation and frequency of application) to each arm of each patient. 
Even though the patients have the same disease, there exists considerable varia¬ 
tion among them, but the two arms of a patient are quite homogeneous. 

(i) What type of experimental design would be appropriate for this study? 

(ii) What are the experimental units? 

(iii) Give a suitable experimental plan for this study and describe how you ob¬ 
tained this plan. 

(iv) For the design given in (iii), outline the ANOVA table, giving sources of 
variation, d.f., and sums of squares. 
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(v) For the plan given in (iii), what is the variance of the estimates of the main 
effects and the interaction? 

11.8 Consider a 2 6 factorial experiment and suppose that the experimenter has only 
enough resources to handle just a fraction of all possible treatment combinations. 
Let this fraction be chosen by the identify relationship 

/ = +ABCD = +CDEF = +ABEF 

(i) Give the treatment combinations that make up this fraction. 

(ii) Assuming that all interactions involving 3 or more factors and all 2-factor 
interactions not involving factor B can be estimated from this fractional 
factorial. 

(iii) Suppose we have 2 replications (that is, 2 EUs) for each treatment com¬ 
bination in a CRD; outline the ANOVA table (giving sources of variation, 
d.f. and E(MS)) based on the assumptions given in (ii). 

(iv) Suppose we need to use blocks of size 8 and we have 4 blocks available; 
under the assumptions in (ii) give a suitable arrangement without sacrific¬ 
ing information about the main effects and 2-factor interactions involving 
factor B. 

(y) For the design in (iv) outline the ANOVA table, giving sources of variation 
and d.f. 

(vi) Describe how you would obtain the ANOVA table in (v) with SAS. 

11.9 Use SAS PROC FACTEX to construct designs equivalent to those given in (i) 
and (iv) in Exercise 11.8. 

11.10 Show that for the 3 3 factorial contrasts belonging to A and ABC 2 are orthogonal 
to each other. 

11.11 Construct a system of partial confounding for the 3 2 factorial in blocks of size 
3 with 6 blocks so that at least partial information can be obtained about the 
2-factor interaction components and full information about the main effects. 

11.12 Obtain the treatment combinations and the alias structure for a 1/3 fraction of the 
3 4 factorial (assuming that 3 - and 4-factor interactions are negligible). 

11.13 For the fraction obtained in Exercise 11.12, obtain a system of confounding using 
blocks of size 9. State what assumptions need to be made for this design to be 
useful. 

11.14 Consider the 2 3 x 3 2 factorial. Obtain a suitable system of confounding for 
blocks of size 6. 
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CHAPTER 12 


Response Surface Designs 


12.1 INTRODUCTION 

We have mentioned earlier that much of the topic on experimental design, and cer¬ 
tainly most of this book，is concerned with what we call comparative experiments. The 
emphasis and, in fact, the whole purpose of the experiment here is the comparison of 
treatments. We have explored this topic in detail in Chapter 7 for the CRD, with obvi¬ 
ous extensions to other error-control designs. In our discussion we have distinguished 
between qualitative and quantitative treatments, but in both cases the aim has been 
the same: to detect structure of some form among the treatment effects. In the case 
of quantitative treatments this can be done by using methods of regression analysis. 
If, for example, a straight line (with a nonzero slope) can be fitted to characterize the 
dependence of the estimated treatment effects on the treatments then this tells us not 
only that the treatment effects are different from each other, but also that there exists a 
simple relationship among them. 

More generally, the dependence of treatment effects on treatments can be rep¬ 
resented as a response curve (if the treatments are represented by the levels of one 
treatment factor, for example, amount of fertilizer) or a response surface (if the treat¬ 
ments are level combinations of two or more treatment factors, for example, amount 
of fertilizer and rate of application). And such curves or surfaces can be used to make 
judgments not only about treatment structure but also about the relationship between 
treatments and responses, or between input variables and output variables. Knowledge 
of this relationship is important if one wants, for example, to find the treatment combi¬ 
nation which gives the optimal (highest or lowest) response. We shall never know the 
exact relationship but we can try to approximate it. This is done, often sequentially, 
by using methods of experimental design and regression analysis. Methods that are 
directed towards this kind of investigation, using tools from experimental design and 
regression analysis, are usually referred to as response surface methodology (RSM). 

RSM was developed mainly with a view towards industrial experimentation and 
production (see Box and Wilson, 1951) but it has found application also in agriculture 
(see Mead and Pike, 1975)，in medical settings (see Carter, Wampler, and Stablein, 
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1983), and more recently in connection with off-line quality control (see Vining and 
Myers, 1990). And even though RSM has proven to be useful in practice, it suffers 
from a serious defect, namely that the form of the response surface depends on the 
choice of units for the input variables. To illustrate this point we consider a simple 
example. The relationship y = x\-\- X 2 can be pictured as a two-dimensional surface 
in a three-dimensional space giving the dependence of y on x\ and If we change the 
units for the input variables to x\ = 2 x\ and = 3 x2, then the relationship becomes 

V = + ~x\ 2 . 

While y is constant on the curves x\-\-X 2 — constant, that is, on circles in the {x\,X 2 ) 
-plane, it is constant on the curves — constant, that is, on ellipses in the 

x^-plane. Obviously, these surfaces are quite different from each other illustrating 
the point that there is a surface only with a choice of units of plotting, a point that must 
be kept in mind in the following discussion. 

12.2 FORMULATION OF THE PROBLEM 

Suppose we have k quantitative factors Fi ， F< 2 ,.Fk which are known or suspected 
to have an effect on a particular response. Each factor has continuous levels within a 
certain interval; for example, F{ has levels Xi with < Xi < Xm (i = 1 . 2 , ..., k). 
The hypercube {XiL < Xi < Xm ： % = 1,2,...,^:} contains the so-called operational 
region (OR) in which every level combination (Xi . X 2 ...., X^) is a feasible operating 
condition. We assume that each such setting can be controlled (essentially without 
error) by the experimenter. To each setting (Xi,X 2 ,... ，為 ） belongs a response, 77 , 
which is some function of the levels, that is, 

"= 0 ( X1 , X2 ,... ，為 ; H …為)， (12.1) 

where H … ， 0q are parameters. We write (12.1) for short as 

r? = 0 (X; 0 ) ( 12 . 2 ) 

with X = (Xi, X 2 ,...,Xfc) / and 6 = ( 沒 1 ，汐 2 , ... ，汐 g)’. Now both the true yield, 
rj = 77 (X 1 , X 2 ,..., X/c), at any given point in OR and the form of the functional 
relationship (p are unknown. Instead, we will have available only observed responses 
y = y(X) and we shall attempt to approximate 0(X, 0) by a polynomial function 
/(X,y9) in X. We then consider in place of (12.2) a model of the form 

y(X) = /(X./3) + e(X), (12.3) 

where f3 = {H . ■., ffmY are unknown parameters and e(X) represents error. 

Ideally we would like to have y(X) available for a sufficiently fine grid in OR in 
order to approximate 4>, or rather a realization of 0, sufficiently well. From a practical 
point of view this is clearly impossible. Instead we will be restricted to a relatively 
small number of points (these are sometimes referred to as runs or experiments) which 
will typically be confined to a region which is called the experimental region (ER) or 
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region of interest. Obviously, ER is contained in OR. The basic idea, due to Box and 
Wilson (1951), here is then the following: Based on our limited knowledge about the 
process under study we choose an ER. We assume that the response surface for ER 
is sufficiently smooth and hence can be approximated by a low-order polynomial, for 
example, of first or second degree. We then choose an appropriate treatment and error- 
control design to estimate the coefficients of the polynomial. From this we predict 
the response for any point in ER. If one of these points attains the optimal response 
then presumably our goal is achieved (we may, however, have reached a local optimum 
rather than a global optimum); if the fitted response surface indicates that the optimum 
may be outside ER then we would have to choose a new ER and repeat the whole 
process until the (predicted) optimum can be located. 

As Box and Wilson (1951) point out, the procedure described above leads to two 
sources of error: (i) experimental and sampling error in estimating the function /(X; /3) 
of (12.3) and (ii) bias due to the inadequacy of /(X;/3) approximating 0(X; 6) of 
(12.2). To minimize these errors, singly or jointly, is essentially the focus of response 
surface designs. To this end, Box and Hunter (1957), suggested the following basic 
requirements for such designs: 

(i) Assuming that a polynomial /(X;/3) of degree d approximates 0(X; 9) suffi¬ 
ciently well, the design should allow /(X; /3) to be estimated with satisfactory 
precision in ER. 


(ii) The design should allow to check whether the chosen /(X; /3) provides a satis¬ 
factory fit to the response surface or whether a different polynomial maybe more 
appropriate. 

(iii) The design should not contain an excessively large number of experimental 
points. 

(iv) The design should lend itself to adequate blocking of the experimental points. 


(y) One should be able to amend the design in case the polynomial of degree d 
proves to be inadequate and a polynomial of degree d + 1 needs to be fitted. 


These requirements were refined and expanded by Box (1968) and Box and Draper 
(1975) (see also Box and Draper, 1987). We shall not go into the details here but rather 
concentrate in the following on the five fundamental points above. 

In the following we shall describe the basic tools and designs of RSM and point out 
connections to treatment and error-control designs discussed elsewhere in this book. 
For details and further developments of RSM we refer the reader to specialized texts 
on this subject, for example, Box and Draper (1987), Khuri and Cornell (1996)，Myers 
and Montgomery (2002). 
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12.3 FIRST-ORDER MODELS 
AND DESIGNS 


12.3.1 First-Order Regression Model 


Within a small region it is often not unreasonable to approximate the response surface, 
that is, the function 0, by a first-order polynomial in the k input variables Xi : X 2 ,... ,Xk ： 

k 

y = 0 q 〉: PiXi + e. (12.4) 


In this model the regression coefficient 爲 is a measure of the change in the response 
y due to a change in the input variable Xi. This is, of course, the kind of information 
provided by the main effects from a factorial experiment where each factor has two 
levels (see Section 11.3). A natural choice of a response surface design for this situation 
is therefore a 2 k factorial, or a fraction of it. 

Suppose then we have 2 k experimental points (X 1 .X 2 , … ， Xk)j say, with j = 1 
2, … ， 2 k . With each level combination being replicated r times in a CRD we have 
N = r2 k experimental runs. Denote the low and high level of the ith factor by Xio 
and Xu, respectively. If we use instead of Xi the coded levels 


Xi 


Xi-X 


— ^io) 


then the low and high levels become xiq 
(12.4) now as 


(12.5) 

1 and Xu = 1, respectively. We rewrite 


k 

y(xi,x 2 ,.. ■ ， x k )i = 00 + y^S^Xj + e(xi,x 2 ,.... x k )i, (12.6) 

i~l 

where X{ — 土 1， or in matrix notation as 


y = (3.D)/T + e ， 

where y is the N x 1 vector of observations, D is an N x 1 vector of unity elements, D 
is the N x k design-model matrix of —Vs and l’s，= (H 的， ..., 與 )’， and e 
is the N x 1 vector of errors. More specifically, if we write 

D = (di ， d 2 , … ， dfc) 

as k N x l column vectors, we know that each has ■r2 k ~ 1 elements equal to —1 and 
r2 fc_1 elements equal to 1. This implies that 3’d< = 0 for every i. Moreover, we have 
d-d^/ = 0 for every z, i' with i ^ that is, the d/s are orthogonal to each other. 


12.3.2 Least Squares Analysis 

Using the properties given above, the normal equations for the of (12.6), 

(3，卿，_* = (3斯 
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simplify to 


It then follows that 


⑵. 


^ ^y{xi,X 2 ,.--,Xk)l 


X 1.3^2 j •• • fc 


= ― [(sum of all observations with Xi = 1) 

— (sum of all observations with = —1]. (12.8) 

We note that 念 is half the corresponding main effect given in Section 11.3. It follows 
further from (12.7) that 

var(/5*) = ( 12 . 9 ) 

and 八 

cov(/3*. /?*/) = 0. 

Hence, for any given point z = {zi ， Z 2 , ..., Zk)' in the ER given by {-1 < < 1; 2 = 

1 ， 2,… ， fc} we obtain the predicted response 


y(z) = d* 0 +Y,3；Zi 


( 12 . 10 ) 


var[y(z)] 


( 12 . 11 ) 


In order to evaluate which factors are influential and to investigate the response 
surface [as given by the y(z)] in more detail we need to obtain an estimate of This 
is achieved, as usual, through the AN OVA as given in Table 12.1，where 


y(xi,x 2 ^^,x k )i 


CCl ,X2 ，… ：^fe 


SS(Total) - 


D 2 = Di —SS(PE) 
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Table 12.1 ANOVA for First-Order Response Surface Design 


Source 

d.f. 

SS 

Regression 

pi 

k 

1 

A r (4 r ) 2 

P*2 

1 

N(k) 2 

Pk 

1 

_ 2 

Error 

r2 k -k-l 

D x = SS{E) 

Lack-of-fit error 

2 k -k-1 

D 2 = SS(LOF) 

Pure error 

2 k (r - 1) 

E E (2/( x )； - v( x )-) 2 = ss(PE) 

X l 

Total 

N -1 

EU( x )i — 5(.).) 2 

x.l 


We mention here that SS(E) consists of two parts, the usual error sum of squares 
for a CRD, denoted here by SS(PE), that is, sum of squares for pure error, and the 
sum of the sums of squares for all interactions for the 2 k factorial denoted here by 
SS(LOF). As in regression analysis this sum of squares can be used to test whether 
the postulated model (12.6) provides a sufficiently good enough fit to the data, a point 
to which we shall return later. To test whether the ith factor contributes to explaining 
the response we use the F-test 


= SS(^) 
- MS(E) 


(i = 1,2,... .A:) 


with 1 and v = N-k-l d.f. Suppose we consider, without loss of generality, only the 
first k\ factors to be important. We may then reconsider model (12.6) and use instead 


and 


with 


ki 

y(xi,x 2 ,...,x k )i = Po+Yl 3 i Xi + 伞 1 ，工 2 ，., -^k)l 


y(z) = 00 + 5Z . 念巧 


#(z)l 


N 



MS(E). 


( 12 . 12 ) 


(12.13) 


(12.14) 
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We may then compare the responses for two different sets of input variables, say z = 
{zi,z 2 , ...,z ki y and w = (u’i, 切 2 ,… , w ki y，by considering 

ki 

- y(w) = -Wi) 

i=l 

and 

1 fcl 

var[y(z) - y{w)} ^ — - Wi) 2 MS{E) 

i=\ 

Similarly, we may consider differences in responses if some of the input variables 
are kept constant at a desired level and the remaining input variables are varied to 
achieve optimum response if indeed it can be achieved in ER. Due to the fact that we 
are approximating the true response surface and due to experimental error there may, 
of course, not exist a single level combination which achieves the optimum response 
but rather the estimated responses in the neighborhood of an optimum may not be 
significantly different from each other. 

12.3,3 Alternative Designs 

The use of a full 2 k factorial to estimate the parameters of a first-order response surface 
will usually be wasteful, especially if it is used in a CRD with r replications for each 
level combination. There are basically two ways to reduce the number of experimental 
points. One way is to replicate each design point {x\,X 2 ^ ..., Xk) only once, that is, 
r = 1. In that case we have SS(PE) = 0 and SS(E) = SS(LOF) (see Table 12.1). 
Another way is to use only a fraction of a 2^ factorial (see Section 11.7) either as a 
single replicate or as a CRD with r > 1 replications. In either case we need to choose 
a fraction such that all k main effects are estimable and that sufficient d.f. for error 
will be available so that comparisons of the type (12.15) can be made with satisfactory 
statistical power as measured by the variance (12.16). This means that if we were 
to choose a very small fraction, such as a resolution III fractional factorial, we need 
several replications for each design point. Even if we were to choose a fractional 
factorial of resolution IV or V we may need some replication. Methods for constructing 
fractional factorials are discussed in Chapters 11.13 and 14. 

An important property of a 2 k factorial is that blocking can be accommodated eas¬ 
ily without sacrificing estimation of the main effects, that is, the Such blocking 
may become necessary for a number of reasons, mainly determined by practical and 
experimental considerations. For example, it may not be possible to complete all exper¬ 
imental runs with one batch of raw material and one suspects systematic batch-to-batch 
variation. For a full factorial appropriate blocks can be obtained (by using the methods 
indicated in Section 11.6 and more fully developed in Chapter II.8) as long as the block 
size ， 2、 is larger than the number of factors, k, for example, a 2 3 in blocks of size 4, 
a 2 4 in blocks of size 8, a 2 5 in blocks of size 8 or 16, and so on. Similar blocking 
arrangements can also be constructed for fractions of a 2^ factorial, for example, a 1/2 
fraction of a 2 5 in blocks of size 8, a 1/4 fraction of a 2 6 in blocks of size 8, and so on. 


(12.15) 


(12.16) 
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An alternative to the factorial designs described above is a class of designs referred 
to as simplex designs (Box, 1952). These are orthogonal designs, that is, the columns 
of the design-model matrix D satisfy d-d^ = 0 for i ^ with /c + 1 design points, 
which are then replicated r times in a CRD or a RCBD. The design points are located 
at the vertices of a regular fc-dimensional simplex, which for fc = 2 is an equilateral 
triangle, for A: = 3 is a tetrahedron, and so on. For r = 1 the matrix D can in general 
be written as (see Khuri and Cornell, 1996) 


D 


，— ai — —0.2, … ~^k\ 

d\ — ct2 一 叩 ... 一 Gfc 

0 2a 2 -^3 : 


0 


\ 0 


0 


~0-k 

kak/ 


where a ^ = Ci[(k + l)/z(?' + l)] 1 〆 2 and c?: are scaling factors (it is common practice to 
choose Ci = c for every i). Since the simplex design contains only fc + 1 experimental 
points to estimate fc + 1 regression coefficients it has zero d.f. for SS(LOF) (see 
Table 12,1). ^ 


12.4 SECOND-ORDER MODELS 
AND DESIGNS 

12.4.1 Second-Order Linear Regression 

One advantage of using a factorial with r replications over a simplex design with r 
replications is that the factorial design provides an opportunity to check the adequacy 
of the model (12.4) through the F-test 


Flof = 


MS(LOF) 

MS(PE) 


(see Table 12.1). This test allows us to check whether interactions among the factors 
are present and if so the design enables us to obtain the various sums of squares for 
two-factor interactions, three-factor interactions, and so on (see Section 11.3). This, 
however, may provide only part of the answer, why (12.4) is not a good approximation 
to the true response surface d(Xi,X 2 ,... Another reason for an inadequate 

fit may be that curvature due to various factors is present. This, however, cannot be 
detected with a 2 fe experiment. We shall now consider an extension of model (12.4) 
incorporating some form of curvature and interaction and suitable designs to estimate 
the parameters of such models. 
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Table 12.2 ANOVA for Second-Order Response Surface Design 


Source d.f. 


Regression 

Linear effects 
Quadratic effects 
Linear x linear effects 
Error 

Lack of fit 
Pure 


2k + |fc(/c — 1) 

k 

k 

Ik(k-l) 

r3 k -2k- |fc(fc — 1) — 1 
3^ — 2k — ^h{h — 1) — 1 
3 fc (r — 1) 


A second-order model for k input variables is defined as 

k k 

y(Xi,X2, ..., Xk) = Po + 0iXi + 0u^i + 0ijXiXj 

i=l i=l i<j 

+ (12.17) 

12.4.2 Possible Designs 

An obvious but usually not the best design for estimating the parameters of this model 
is a 3 fc factorial. If we choose the levels for each factor to be equidistant we can 
reparameterize (12,17) in terms of orthogonal polynomials (see Section 11.9.1) and 
obtain estimates of the linear, quadratic and linear x linear effects for all factors and 
two-factor combinations, that is, of a pq with p^,q = 0.1. 2 and p + g < 2 (see model 
(11.49). For r replications of each design point in a CRD a sketch of the ANOVA is 
given in Table 12.2 (for other details see Table 11.16). Here again we can partition the 
d.f. for error into d.f. for lack of fit (accounting for interactions other than linear x 
linear) and d.f for pure error (arising from replications). Even for small k the number 
of d.f. for lack of fit is quite substantial, stemming from a large number of experimental 
points and the (assumed) absence of most interaction effects. There are several ways 
in which we can reduce the excessive number of experimental points: 

(i) We can eliminate replication of experimental points, that is, choose r ― 1. 

(ii) We can use fractional factorials with or without replication, but we have to re¬ 
strict ourselves now to resolution V designs so that main effects (linear and 
quadratic) and two-factor interactions can be estimated, for example a 1/3 frac¬ 
tion of a 3 5 or 3 6 , or a 1/9 fraction of a 3 7 (see Sections 11.9.5, 11.9.6, and 
Chapter 11.13). Even in these cases the number of experimental points is gener¬ 
ally excessive for the number of parameters (regression coefficients) to be esti¬ 
mated. 
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(iii) We may try to construct designs which are more suited for the specific situation 
with a limited number of design points. Several such classes of designs have 
been proposed. We shall consider briefly two — central composite designs and 
Box-Behnken designs. 

12.4.3 Central Composite Designs 


The central composite design (CCD) was introduced by Box and Wilson (1951). Each 
factor is used at five different levels, but not all level combinations occur. Rather, the 
CCD is composed of three parts: 

(a) a factorial or “cube” part consisting of 2 k ~ p points from a full 2 k factorial (p = 
0) or a 1/2 P fraction of the 2 k factorial of at least resolution V (see above), each 
point being replicated r/ times; the levels of each factor are coded as —1 and 
+1; the number of experimental runs is n/ = 2 k ~" p rf\ 

(b) an axial or “star” part consisting of 2k points on the axis of each factor at a 
distance a from the center of the design, each point being replicated r a times; 
this gives rise to n a = 2kr a experimental runs; 

(c) no replications of the center point (0, 0, … ， 0). 

The total number of experimental runs then is N = ri f + n a + n.o. 


Example 12.1: For k = 2, the basic CCD is as given in Figure 12.1. The design 
matrix for this design, D* say, with rf = l.r a = 1- no = 1 can be written as 



0 

0 



0 

0 


a 

~a 


0 / 


(12.18) 


□ 

The values for a.rj. r a , and no can be chosen to obtain certain properties of the de¬ 
sign and to satisfy economic requirements. One such property, that of rotatability, was 
introduced by Box and Hunter (1957). A design is said to be rotatable if the prediction 
variance, for a level combination z = {z\. Z 2 ...., ZkY in ER, that is, var[ 々 (z)]，is the 
same for all points that are equidistant from the design center. This property is satisfied 
for the first-order designs discussed in Section 12.3 (see (12.11) which depends only 
on E zf) and it is satisfied simply because the columns of D are orthogonal (and hence 
the design is orthogonal) and because of the scaling used. For second-order designs 
the conditions are more complex in general having to do with the so-called design mo- 
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X2 



Figure 12.1 CCD for k = 2. 


ments [see Box and Hunter (1957)，Box and Draper (1996), Khuri and Cornell (1996 )， 
Myers and Montgomery (2002)]. For the CCD the conditions are satisfied by choosing 


a = 



(12.19) 


For example, for k = 2.rf = r a , we have a = V2. 

12.4.4 Blocking in Central Composite Designs 

An important property for CCDs is that of orthogonal blocking as discussed by Box 
and Hunter (1957). This idea is similar to that of systems of confounding in factorial 
experiments (see Section 11.6) which also leads to orthogonal blocking, except that we 
have to deal here with the different components of the CCD. If we denote the levels of 
the ith factor for the N experimental runs by xu (Z = 1,2,, N )，each Xu being one 
of -a, —1,0,1, a, then Box and Hunter (1957) give the following two conditions for 
orthogonal blocking: (i) Each block must itself be a first-order orthogonal design, and 
(ii) the fraction of the total sum of squares of each input variable contributed by every 
block must equal the fraction of the total observations in the block. Suppose we have 
b blocks and the size of the uth block is n u , that is, I^ =1 n w = N. Then, according to 
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(i) we must have 

XiiXji =0 (i.j = 0, 1” . •, _ j) (12.20) 

K u ) 

for every u — 1,2,.... 6, where E z ( u ) means summation over all l in block u. Condi¬ 
tion (ii) can be written as 

E4 

^ — = 每 (i = 1.2,... . fc ) (12.21) 

i=i 

for every u = 1.2 ,..., 6. 

Different blocking schemes satisfying (12.20) and (12.21) can be derived from con¬ 
sidering first the case of 6 = 2 blocks. One block consists of all n/ runs from the 
factorial part plus no/ center runs; the other block consists of all n a runs from the axial 
part plus riQa center runs, where no/ + n^ a = no. It is obvious by just looking at D ¥ 
that condition (12.20) is satisfied for all pairs (i, j) and both blocks. Condition (12.21) 
for the first block is 


n f _ 

_ rif + nof 

( 12 . 22 ) 

rtf + 2r a a 2 

N 

and for the second block 



2 r a a 2 

几 a + 几 0 a 

(12.23) 

nj + 2 r a a 2 

—N ， 

which is the same for every z = 1,2. k. Combining (12.22) and (12.23) yields 


(12.24) 

2r a a J n a -f n 0a 

Thus (12.24) gives us the value of a such that (12.21) holds, that is, 

2 krif n a + noa 

a = - - - - 

n a rtf + n 0 f 

(12.25) 

l + nof/n f 

Typically no a /n a and riof/rtf are quite small, so that a = Vk which means that the 
axial points have about the same distance from the center as the factorial points. 

Orthogonal blocking with smaller blocks than those discussed above can be ob¬ 
tained, using similar arguments, in a number of ways. We mention just a few: 

(i) If 77 > 1， each block may consist of one or more replicates of all 2 k points plus 
some center runs. 

(ii) Systems of confounding as discussed in Section 11.6 and in Chapters II .8 and 9 
may be used together with some center runs. 
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(iii) If r a > 1， each set of axial points plus some center runs may form a block. 

(iv) One block may consist of half of the axial points, say all points with +a，plus 
some center runs, and the other blocks have all axial points with —a plus some 
center runs. 

Methods (i) and (ii) may be combined with methods (iii) and (iv). Similar methods can 
be used if a fractional factorial is used for the factorial part of the CCD, remembering 
that those fractions have to be at least of resolution V. 

Box and Draper (1987) give some recommendations for the choice of r/ and r a ， 
and Draper (1982) discusses criteria for deciding on no = no/ + no 0) the number of 
center runs. If more experimental runs are needed, for example to increase d.f. for 
pure error, it is often convenient to increase no without sacrificing other properties of 
the CCD, 


12.4.5 Box-Behnken Designs 

We have mentioned earlier that using a 3 k factorial for a second-order response surface 
usually results in too many experimental points. The CCDs discussed above correct 
this situation, but they use five levels for each factor. An interesting class of designs 
using only three levels of each factor and at the same time resulting in a “reasonable” 
number of experimental points was proposed by Box and Behnken (1960). These de¬ 
signs can be constructed by combining ideas from incomplete block designs (BIBD or 
PBIBD; see Section 9.8 and Chapters II. 1-5) and factorial experiments, specifically 2 k 
factorials. The method can be described as follows. 

Suppose we have t input variables and an incomplete block design 

with t treatments and b blocks of size k. This design is characterized by its incidence 
matrix N = (nu) with nu = 1 if treatment l occurs in block i and nu = 0, other¬ 
wise. We now identify the t treatments with the t input variables and consider N’. Each 
row of N’ contains k unity elements. Suppose in the first row they occur in columns 
… We then replace these k unity elements successively by the level combi¬ 
nations of the 2 k factorial where the k factors are the input variables … The 
t — k zeros in the first row are replaced by 2 fc x 1 vectors of zeros. This procedure is 
repeated for each row of N / resulting in b2 k experimental points to which we add no 
center runs (see Jo and Hinkelmann, 1993). 


Example 12.2: Consider the case t = 6. The matrix N ; of a PBIBD (design R42 in 
Clatworthy, 1973) with blocks of size /c = 3 is given by 


N / = 


/I 

0 

0 

1 

0 

\1 


10 10 0、 

110 10 

0 110 1 

0 0 110 

10 0 11 

0 10 0 1 / 
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Table 12.3 Box-Behnken Design For t = 6 

Xl 

X2 

怎 3 

X 4 

^5 

xe 

Xl 

X2 

$3 

X4 

Xo 

Xe 

一 1 

-1 

0 

一 1 

0 

0 

-1 

0 

0 

-1 

-1 

0 

1 

-1 

0 

一 1 

0 

0 

1 

0 

0 

-1 

-1 

0 

—1 

1 

0 

-1 

0 

0 

-1 

0 

0 

1 

—1 

0 

1 

1 

0 

-1 

0 

0 

1 

0 

0 

1 

-1 

0 

—1 

-1 

0 

1 

0 

0 

-1 

0 

0 

-1 

1 

0 

1 

-1 

0 

1 

0 

0 

1 

0 

0 

-1 

1 

0 

-1 

1 

0 

1 

0 

0 

-1 

0 

0 

1 

1 

0 

1 

1 

0 

1 

0 

0 

1 

0 

0 

1 

1 

0 

0 

-1 

一 1 

0 

-1 

0 

0 

-1 

0 

0 

-1 

-1 

0 

1 

-1 

0 

-1 

0 

0 

1 

0 

0 

-1 

—1 

0 

-1 

1 

0 

-1 

0 

0 

-1 

0 

0 

1 

-1 

0 

1 

1 

0 

-1 

0 

0 

1 

0 

0 

1 

-1 

0 

-1 

-1 

0 

1 

0 

0 

-1 

0 

0 

—1 

1 

0 

1 

-1 

0 

1 

0 

0 

1 

0 

0 

-1 

1 

0 

-1 

1 

0 

1 

0 

0 

-1 

0 

0 

1 

1 

0 

1 

1 

0 

1 

0 

0 

1 

0 

0 

1 

1 

0 

0 

-1 

-1 

0 

-1 

-1 

0 

-1 

0 

0 

-1 

0 

0 

1 

-1 

0 

—1 

1 

0 

~1 

0 

0 

-1 

0 

0 

-1 

1 

0 

-1 

-1 

0 

1 

0 

0 

一 1 

0 

0 

1 

1 

0 

-1 

1 

0 

1 

0 

0 

-1 

0 

0 

-1 

-1 

0 

1 

-1 

0 

-1 

0 

0 

1 

0 

0 

1 

-1 

0 

1 

1 

0 

-1 

0 

0 

1 

0 

0 

一 1 

1 

0 

1 

—1 

0 

1 

0 

0 

1 

0 

0 

1 

1 

0 

1 

1 

0 

1 

0 

0 

1 







0 

0 

0 

0 

0 

0 


The level combinations for the 2 3 factorial are 

Xl, 

xi 2 xi 3 

-1 

—1 -1 

1 

-1 -1 

-1 

1 -1 

1 

1 -1 

-1 

-1 1 

1 

—1 

-1 1 

1 1 

1 

1 1 . 

Substituting xi x . xi 2 , and xi 3 for the Vs in each row and adding one center run we 
obtain the Box-Behnken design of Table 12.3. □ 
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To obtain economical designs we need to choose incomplete block designs with k 
and b small so that N = 62 fe + no does not become too large. For larger k we may 
use a fractional factorial of resolution V instead of the complete factorial. For partic¬ 
ular choices of incomplete block designs (that is, resolvable designs, see Chapter II.2) 
orthogonal blocking is possible. This may also be achieved by using a system of con¬ 
founding such that no main effects and no two-factor interactions are confounded with 
blocks. Box and Behnken (1960) give a list of designs for some values of t together 
with possibilities for orthogonal blocking. 

12.4.6 Hard-to-Change versus Easy-to-Change Factors 

We have mentioned earlier that the response surface designs, for instance, the CCD of 
(12.18)，are embedded in some form of error control design, in most cases a CRD or 
RCBD. This means, of course, that the treatment combinations are randomly assigned 
to the experimental units. In industrial experimentation it is often the case that the 
runs are performed sequentially. Random assignment then means random order of 
application. This implies that the factors have to be reset for each run, whether there is 
a level change or not. In practice the resetting is often not done if a factor remains the 
same between two or more runs，either because of convenience or because the factor in 
question is hard to change. This has led to the notion of hard-to-change factors (HTC) 
and easy-to-change factors (ETC). 

Webb, Lucas and Borkowski (2004) report on an example with one HTC factor and 
two ETC factors: 

Example 12.3: An experiment was performed to investigate three factors in the 
operation of a wrapper machine: spacing of the seal crimper, speed of the machine, 
temperature of the seal crimper. Spacing was recognized as a HTC factor, whereas 
speed and temperature were thought to be ETC factors. The experiment, using a Box- 
Behnken design (see Section 12.4.5), was set up in “blocks” of levels of the HTC 
factor, that is, in each “block” the HTC factor, was at the same level and no resetting 
took place. Within the “blocks” the ETC factor levels were randomized according to 
the chosen design, except when the experiment was actually performed it was found 
that speed also turned out to be a HTC factor. As a consequence, this factor was not 
reset when the same level occurred in consecutive runs. As a result, the experiment 
was conducted as illustrated in Table 12.4 ， where the lines indicate the “blocks” of not 
reset levels for spacing and speed. □ 

We shall not pursue this example here further, referring the reader to Webb, Lucas 
and Borkowski (2004)，except to say that this is an example of a split-split-plot type 
experiment (see Section 13.6) which is highly unbalanced and generally undesirable. 
Because of the split-split-plotting two additional errors will be induced and, as a con¬ 
sequence, an analysis using generalized least squares (GLS) (see Section 4.16.2) needs 
to be performed instead of ordinary least squares (OLS) with model (12.17). 

To avoid some of the complications associated with such an unbalanced design it 
is advisable to construct designs that show a certain amount of balance. In addition it 
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Table 12.4 Wrapper Machine Example 


Spacing 

Speed 

Temp 

0 

1 

一 1 

0 

1 

1 

0 

0 

0 

1 

0 

1 

1 

0 

-1 

1 

-1 

0 

1 

1 

0 

-1 

1 

0 

-1 

- 1 

0 

-1 

0 

-1 

-1 

0 

1 

0 

0 

0 

0 

-1 

-1 

0 

-1 

1 

0 

1 

1 


would be desirable to be able to estimate the parameters in the model 

y = X/3 + Zq + e_e ， (12.26) 


such that the estimates are identical under GLS and OLS. In (12.26) X and Z are known 
matrices, where Z is determined by “blocking” of the HTC factors. In the terminology 
of split-plot designs (see Chapter 13) the HTC factors are called whole-plot factors and 
the ETC factors are the split-plot factors. Thus, and are referred to as whole-plot 
and split-plot errors, respectively. 

Parker, Kowalski and Vining (2007) refer to designs that satisfy the properties men¬ 
tioned above as equivalent estimation designs. They provide techniques for construct¬ 
ing such designs, of which the following is an example. 

Example 12.4: Suppose we want to use a CCD for two factors A and B say, as 
given in (12.18). We identify A as a HTC factor and B as a ETC factor. An equivalent 
estimation CCD consists then of r(> 2) replicates of the design given in Table 12.5. 
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For this design we have for Z in (12.26) 



If there are more than one HTC factors, then each of these factors individually is 
kept at the same level for each whole plot. For example, if we have two HTC factors, 
A\ and two whole plots of the design may look as follows: 


Whole plot 


2 


It is important to remember that has to be reset in whole plot 2 even thought the 
levels are the same in whole plots 1 and 2. 

12.5 INTEGRATED MEAN SQUARED 
ERROR DESIGNS 

In our discussion of first-order and second-order designs we have made the assumption 
that the first-order and second-order models, respectively, are satisfactory approxima¬ 
tions to the true response surfaces. If this is true then the designs discussed are ap¬ 
propriate. Often, however, there is the fear that the assumption may not be right. We 
refer to such a situation as model misspecification. We may suspect, for example, that 
instead of a first-order model a second-order model may provide a better approxima¬ 
tion to the true situation, but we are not sure. An obvious reaction would be to use a 
second-order design so that we can estimate all second-order effects. The drawback of 
this approach is that if our suspicion is not true then we have wasted valuable resources 
by using too many experimental points. One must, therefore, find some compromise 
for the choice of an appropriate design; firstly, it must enable estimation of the parame¬ 
ters of the specified model sufficiently well; secondly, it must provide some protection 
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Table 12.5 Basic Replicate of Equivalent Estimation CCD 


A 

B 

-1 

一 1 

-1 

1 

1 

-1 

1 

1 

a 

0 

a 

0 

一 a 

0 

—a 

0 

0 

a 

0 

—a 

0 

0 

0 

0 


against model misspecification; and thirdly, it must be economical. We shall explain 
this for a very simple situation and then make some more general comments. 


12.5.1 Variance and Bias for the One-Factor Case 

Consider the case of one factor X and suppose we approximate cp(X. 6) by 

f (X., 0)=00+Pi X 


Suppose further that ER is defined by X L < X < Xu, or in the coded variable x by 
—1 < x < 1. For a given set of N : r-values ， 尤 1 , 工 2 , x n with x = 0, we then fit 
the model 

y = Pq + 8lx + e (12.27) 

and obtain the predicted (estimated) response curve 

y{z) = 0^,8lz (12.28) 

for any z in [—1,1]. Denote the true value of the response curve at z by 4>{z). Then the 
mean squared error associated with estimating (p(z) by y(z) is given by 

E [{){-) - <?W] 2 = E{y(z) - E[y(z)} + E[y{z)\ - <p{z)} 2 
= var[y(^)] + {E[y(z)\ - <p{z)} 2 . 


(12.29) 
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For purposes of comparison it is useful to normalize (12.29) for the number of experi¬ 
mental points, N ， and the error variance, as 

NE[y{z) - d)(z)} 2 TVvar [ 沉 ; r)] + N{E[y{z)} - 0{z)} 2 (12 3Q) 

<j e 2 a 2 e aj . 

which we write for short as 


M{z) = V(z) + B{z), 


(12.31) 


where V(z) is the variance at z and B(z) is the squared bias at z. For the special case 
here we find [see (12.11)] 


v{z) = 1 + k' 


(12.32) 


where g 2 x = (l/N)Eixf. is also referred to as the second design moment and 
denoted by [1 1] (analogously, (l/N)H x\ is referred to as the first design moment, 

denoted by [1], which equals zero in our case, and [1 1 1] = (l/N)Exf is the third 

design moment). The bias portion of M(z) depends, of course, on (p(z). Suppose that 


E[y(x)] = d(x) = ^ + (3lx + /3^ 2 . (12.33) 

In order to evaluate E[y(z)] we shall write (12.33) in matrix notation as 


where 


Now 


E(y) = ( x n ) ⑵， 



■1 a：r 




1 X2 


x\ 

Xi = 


， x 2 = 



.1 


A 、 


7l = ( 急)， 72 二泻. 


y( 2 ) = (1,21)7! 

= (l,z)(X' 1 X 1 r 1 X' 1 y 
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and 


^[y( 2 )] = (l,z)(X / 1 X 1 )- 1 X ； £(y) 

= (Lz)(X ； X 1 )- 1 X ； (X 1 ； X 2 )^^ 

=+ X 272 ) 

==/3 0 * + + (1. 2)(X , 1 X 1 )- 1 X / 1 X 272 (12.34) 

=/3p + ,^^ + I ^ I 02 - (12.35) 

In (12.34)，the matrix (X / 1 Xi) _1 X / 1 X 2 is called the alias matrix. Using the moment 
notation, B(z) can now be written as 

= l] 2 + [1 1 l]z)-z 2 ^ . (12.36) 

Rather than consider the mean squared error just for an arbitrary point ， 2 , it is more in¬ 
formative to consider some sort of average mean squared error, referred to as integrated 
mean squared error (IMSE) and defined as 


f ER M(z)dz 

f ER dz • 

Performing this operation for V(z) and B{z), we obtain for the IMSE 


(12.37) 


, f IerI V ( z ) + B ( z )] dz 

^~ 

= V-\-B. say. (12.38) 

For our example we have ER = [—1 ， 1] and hence f ER dz = dz = z. Substituting 
(12.32) and (12.36) in (12.38)，we obtain 


M 


3[1 




[1 1] 2 


[1 1 l ] 2 2[1 1 ] 

3[1 l] 2 3 


+ r • (12.39) 


One would like to choose a design that minimizes M and from (12.39) it can be seen 
that this has to be done by affecting the design moments [1 1] and [1 1 1]. Unfor¬ 

tunately, such a choice will also be influenced by the unknown parameter /3 《 /<j e ，the 
standardized measure of the true curvature. Furthermore, the choice of the design will 
depend on the relative magnitude of V and B and how important they appear, relative 
to each other, to the investigator. 
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12.5.2 Choice of Design 


There are two extreme cases: (i) V is much larger than B and (ii) B is much larger 
than V. For all practical purposes we can think of (i) as V > 0 : B = 0. To minimize 
M then means to minimize V and that, in turn, means to maximize [1 1]. For iV 

even, this is achieved by choosing N/2 experimental runs at x = —1 and x = +1 ， 
respectively. For N odd, we choose (N — 1)/2 runs at x = — 1 and x = +1 each and 
one run at or = 0, with [1 1] = (iV — 1) /N. Such designs (which assume B — 0) are 

referred to as all-variance designs. 

Case (ii) above can be characterized essentially by = 0. ^ > 0. To minimize M 
then is to minimize B. The first step might be to choose a design with [1 1 1] = 0. 

It follows then from (12.39) that now 



and hence minimizing B requires a design with [1 1] = Such a design (which 

assumes F = 0) is referred to as an all-bias design. Comparing the values for [1 1] 

for the all-variance and all-bias designs shows that the spread of the experimental points 
for the all-bias design is much smaller than that for the all-variance designs. In fact, 
the conditions for [1 1] for these two types of design are in conflict with each other 

and hence minimizing M cannot be achieved by minimizing V and B separately. 

For the general problem of minimizing M one may start again by choosing [111] = 
0 and then minimize the resulting expression for M with respect to [1 1] for different 

values of iV/^ 2 /cr!. Alternatively, one can minimize M with respect to [1 1] as a 

function of iV/?| 2 /^e and then choose the value of [1 1] which forces V/B to be a 

certain value a which expresses the experimenter’s opinion about the relative values 
of V and B. For example, a = 1 implies that V and B are equally important. In that 
case the optimal value of [1 1] is .388 (for \fN02 2 /(J e = 4.49), comparable to that 

for the all-bias design (Box and Draper, 1959). Computations for other cases show that 
when curvature is suspected the optimal design is closer to the all-bias design than to 
the all-variance design (Box and Draper, 1959; Khuri and Cornell, 1996). 

It is apparent from this simple example that choosing a design which minimizes 
IMSE is rather complex and becomes even more so if we consider a first-order model 
or a second-order model and want to protect against second-order effects or third-order 
effects, respectively. General considerations for a specific class of designs indicate, 
however, that optimal designs tend to be close to all-bias designs. More specific results 
are provided by Box and Draper (1963) (see also Box and Draper, 1987, and Khuri 
and Cornell, 1996). It must be emphasized, however, that many of these results are 
somewhat subjective in that they are not always invariant to the scaling of the input 
variables. Hence those results may be taken as general guidelines only when choosing 
an appropriate design. 
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12.6 SEARCHING FOR AN OPTIMUM 

As we have pointed out earlier, RSM is a sequential process based on subject matter 
and statistical input. Experiments are performed using the investigator’s best knowl¬ 
edge about the process under study and the statistician’s recommendations how to best 
perform the experiment (see also Chapter 2). After having decided which factors (input 
variables) should be studied and in which range of levels, usually a first-order design 
is used to approximate the response surface in the chosen ER, In the search for an op¬ 
timum response and the levels of the factors at that optimum response one can imagine 
many scenarios leading to a sequence of experiments and statistical decisions. It is, 
of course, impossible to describe every situation that might possibly arise, instead we 
shall mention briefly some of the steps in the sequence of events. 

Following each experiment it is important to study the estimated response surface 
in some detail. This requires that the underlying design has been chosen with those 
goals in mind. We may, for example, want to 

(i) investigate which factors are important, 

(ii) examine whether the chosen polynomial provides an adequate approximation to 
the response surface, 

(iii) plot the contours of the response surface, 

(iy) decide on a new ER, 

(v) locate the optimum response as quickly as possible. 

Some of these goals are relatively easy to obtain by using designs of the type we have 
discussed in Sections 12.3 and 12.4. We can test hypotheses about the regression coeffi¬ 
cients in the model to check (i). Assuming that the design chosen allows the estimation 
of pure error (constituting experimental and observational error) or if such information 
is available from other sources, we can examine (ii) through a lack-of-fit test as ex¬ 
emplified in the AN OVA s given in Tables 12.1 and 12.2. But even drawing a contour 
map, that is, a map of equal responses, ^(x), for different input variables x is not al¬ 
ways easy. Even if we could draw in a fc-dimensional space the shape of the contours 
depend crucially on the scaling used for the input variables. 

The reason why we mention this is the fact that, loosely speaking, the contour map 
is used to locate new ERs in the pursuit of locating the optimum response. Different 
mathematical techniques have been proposed and are used to find the most direct path 
to the optimum. They all depend on the contour map as established from the results 
of the initial experiment and updated by subsequent experiments as determined by the 
optimization procedure used. 

The procedure most often discussed is the method of steepest ascent which was 
introduced in RSM by Box and Wilson (1951). (For a detailed discussion see also Box 
and Draper, 1987, and Khuri and Cornell, 1996). A direction perpendicular to the con¬ 
tour planes as established by a first-order model or contour surfaces for a second-order 
model is the direction of steepest ascent, pointing towards higher responses. Along 
this path further experiments are performed until a best value or apparent maximum 
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is reached. Such a point may serve as the center point for a new ER in which then a 
more comprehensive experiment will be performed, continuing this cycle as long as 
necessary, changing most likely from first-order designs to second-order designs as the 
situation warrants. The virtues and value of the method of steepest ascent have been 
put into question when Johnson in the discussion of the Box and Wilson (1951) paper 
pointed out that the method suffers from dependence on the scale (that is, choice of 
units) of the input variables. As a consequence, a certain amount of care must be used 
when applying it. Subjective scaling will lead to subjective directions of experimen¬ 
tation and only through checks can a potentially misleading direction be avoided. A 
perhaps more useful method would be to scale the input variables such that the change 
of one unit for one variable is as important as the change of one unit in another variable. 
But even that is not entirely objective and may depend on the location in the OR. 

To avoid the problem of scale-dependence, other optimization procedures have 
been proposed. The method of parallel tangents (PARTAN) was introduced by Shah, 
Buehler, and Kempthorne (1964) and further discussed by Buehler, Shah, and Kempthome 
(1964). Another approach, using simplex designs, was proposed by Spendley, Hext, 
and Hinsworth (1962). Any discussion of these methods is beyond the scope of this 
chapter and the reader is referred to the pertinent literature. 

12.7 EXPERIMENTS WITH MIXTURES 

12.7.1 Defining the Problem 

A special and yet quite distinct application of response surface methodology occurs 
in experiments with mixtures. The special feature of these experiments is that the re¬ 
sponse (12.1) does not depend on the actual values (amounts) of the input (represented 
by the input variables Xi, 乂 2 , • •. ， ^k) but rather on the proportions relative to each 
other, that is, for a mixture of three ingredients we might have Xi = 50%, X 2 = 
25%, X 3 = 25% with, of course, Xi-\- X 2 = 100%. An example of such a mix¬ 
ture experiment may be the blending of three gasoline stocks to determine the blend 
which will give the best mileage. 

The pioneering work in this area was done by Scheffe (1958) who introduced 
simplex-lattice designs and appropriate polynomial models to investigate the type of 
question mentioned above. An excellent account of current methodology and thinking 
is given by Cornell, (2002). We shall give here only a very brief discussion of some 
of the design aspects in this area, to what extent they are different from designs for 
comparative experiments and to what extent they make use of the designs we have dis¬ 
cussed in other chapters. For details the reader should refer to Cornell, (2002) and the 
references therein. 

Let Xi ， X 2 ,.... X/j be the input variables which are constrained by the condition 

= 1 (12.40) 

This condition introduces a dependence among the Xi's which means that they cannot 
take on all values in but only in the ^-dimensional simplex. The coordinate system 
used for these values is called the simplex coordinate system. For fc = 3, for example, 
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(1，0,0) 

xi = 1 



Figure 12.2 Triangular Coordinate Paper. 


this coordinate system can be plotted on triangular graph paper with lines parallel to 
the sides of the equilateral triangle as given in Figure 12.2 

In order to fit an approximate model of the form (12.3)，usually a linear or quadratic 
model, we need to conduct an experiment in order to obtain appropriate observations, 
y(X). Three types of designs are used most often: simplex-lattice designs, simplex- 
centroid designs, and axial designs. 


12.7.2 Simplex-Lattice Designs 

The name simplex-lattice design refers to a collection of uniformly spaced points on 
a simplex. For a fc-dimensional simplex there exist different simplex-lattice designs 
depending on the spacing of the levels, that is，the proportions for each component 
Xi {i — 1 . 2 . .. , ,k) may take the 爪 + 1 equally spaced values 

A = 0 .丄 .1 

subject to (12.40). Such a lattice is referred to as a (/c, m) lattice. For example, the (3, 
2) lattice consists of the points 


{ X1 . X2 - ^ 3 ) = {( 1 ， 0 , 0 )，( 0 , 1 , 0 ), ( 0 , 0 , 1 )，（ U ， 0 ), (辜， 0 , 士 )，( 0 , !，！)} 
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and these points lie all on the vertices and sides of the simplex, that is, triangle (see 
Figure 12.2). For the (3, 3) lattice the points are 

(Xi,X2, X^) = {(1.0.0), (0,1,0), (0.0,1), (|, I ， 0), (|, 0, |). (|, I ， 0), 

(~,0, |), (0. I，|). (0, I，|). (^. I，|)}. 

Here we have in addition to points on the boundary of the simplex also a design point 
at the centroid of the simplex. 

12.7.3 Simplex-Centroid Designs 

For a fc-component simplex-centroid design, the design points are such that either one, 
or two, or three, ..or components are included in the mixture and if / (1 彡/彡 
k) components are included in the mixture they are included in equal proportions, 
that is, l/l. Thus the simplex-centroid design consists of 2 fe - 1 points: k permu- 

tations of (1,0,. •., 0); (g) permutations of 0,..., 0),..., (f) permutations 

of (H … ，十， 0 , … ， 0 )， …， and the centroid (H • • • ，臺 )• The points are located 
at the centroid of the (k — l)-dimensional lattice and at the centroids of all lower¬ 
dimensional simplexes contained in the (k - l)-dimensional simplex. 

12.7.4 Axial Designs 

Whereas for the simplex-lattice design and the simplex-centroid design, the design 
points (with the exception of the overall centroid) are located on the boundaries of 
the simplex, the design points for the axial design are located on the component axes 
(see Figure 12.2 for k — 3). This implies that for every such point all k components 
are included in the mixture. A simple form of such a design was suggested by Cornell 
(1975). In it the points are located at equal distances, say Ai. A 2 ,..from the centroid 
(H … ， I) toward each of the vertices. 

12.7.5 Canonical Polynomials 

For all three types of designs, and combinations of them, the number of points (runs) 
depends to some extent on the degree of the polynomial to be fitted to the data. Typi¬ 
cally, the polynomials are of the first or second degree, that is, 

k 

y(X)=0 o + J2^ x i + e (12.41) 

i=l 

or 

k k k 

y(X) = do + ^ 3iXi + ^2 ^2 PijXiXj + e. (12.42) 

i=l i=l i<j 

The number of points, obviously, has to be at least as large as the number of parameters 
to be estimated in (12.41) or (12.42) or any other model that might be appropriate. We 
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could proceed then as usual, except that for the situation described in this section we 
have to take condition (12.40) into account and because of it the parameters associated 
with the various X-terms are not unique. To remove the dependence among the X- 
values we could write, for example, 


k-l 

X k = l-^2Xi 

i=l 


(12.43) 


and substitute (12.43) in (12.41) and (12.42). In (12.41) this would lead to a model 
with parameters /?i - (i = 1,2,..., A: - 1) obscuring the separate effects of the 
individual components. The effects on model (12.42) are even more complex. 

The preferred way of removing the effect of the dependence among the Xi is to ob¬ 
tain the so-called canonical polynomial For the linear polynomial this form is obtained 
by multiplying /?o in (12.41) by = 1) and then simplifying, that is, 

/ k \ k k 

y(X) = d 0 ij2 X ^) + T,^ Xl+e = J2 ^ Xi + e (12-44) 

\i=l / i=l i=l 

with 0* = fh + 队 [i = 1,2...., k). Model (12.44) retains the symmetry in the k 
components and the ， / 3* have a clear meaning. 

To achieve the canonical form for model (12.42) we proceed in the same way as 
above and use in addition 


x, 2 = 不 



k 


j=l 
i 私 


Collecting terms leads to the canonical model 


k k 

y(X) = E 13;Xi + Y, J2 KjXiXj+e (12.45) 

i=l i<j 

with P* = po + A + 0u and /3*j = Pij - /3u - = 1,2 5 ...,&;z < j). Model 

(12.45) can be simplified still further by multiplying ^ P*Xi by which yields 

k 

2/(X) = ^ y^SjjXjXj + e (12.46) 

i<j 


with da = (3* and Sij = /3*j + /3* +/3J {i.j = 1,2,..., fc; 2 < j). Models (12.45) and 
(12.46) are, of course, equivalent and contain the same number of parameters. 

With data obtained from an appropriate design, models (12.44) or (12.45) or (12.46) 
can be fitted using the method of least squares. Appropriate tests of hypotheses includ¬ 
ing lack of fit can then be performed in the usual fashion. From the estimated regression 
coefficients the response surface can be predicted and standard errors can be obtained 
using familiar procedures (for details see Cornell, 2002). 
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12.7.6 Including Process Variables 

So far we have discussed the situation where the response 77 of (12.1) depends only on 
the mixture variables Xi. In many practical situations there may, however, be other 
variables, not connected with the blending process itself, which influence rj. Such 
variables are referred to as process variables. In the example of blending gasoline 
stocks such process variables may, for example, be type of car (light, heavy) and speed 
of driving (slow, fast). If we denote the process input variables by Zi, Z 2 ,.. •, then 
model ( 12 . 1 ) may be generalized to 

w z) = <P{X U X 2 , … ， X k . ， z u z 2 ”.,,z p 'n. ， e s ) (12.47) 

and (12.47) will be approximated by a (low degree) polynomial in X, Z, and XZ. 
This will be used to assess not only the effects of the blending variables (X), but 
also the additive effects of the various “levels” of the process variables (Z) and, quite 
importantly, the possible interactions (XZ) between the blending and process variables. 

The added problem then is to augment the design for the blending variables (as 
discussed above) with a design for the process variables. The latter will typically be a 
factorial design. In its simplest form this may be a 2 P factorial or, in order to keep the 
number of runs at a reasonable level, it may be a fractional factorial (see Sections 11.6 
and 11.7). The total number of runs is then determined by the number of design points 
for the mixture experiment and the number of treatment combinations used for the 
process variables in that each mixture experiment is performed at each process variable 
combination included in the factorial or fractional factorial design. To make the entire 
experiment more manageable the device of blocking may have to be used. Also, it 
may be possible to reduce the number of runs by combining the design points for 
the two component designs (that is, blending and process) in a way different from 
that described above. This provides an example how mixture designs and designs for 
comparative experiments can be combined in a useful way, and how elements from 
response surface methodology and design of comparative experiments can be brought 
to bear on problems that arise in several types of industries, such as chemical, food, 
textile industries and others. 


12.8 EXAMPLES USING SAS® 

Since analyzing data from response surface experiments involves regression models 
the most appropriate SAS procedure to use for the analysis are PROC REG or PROC 
RSREG. However, for certain purposes and in certain situations, also PROC GLM and 
PROC MIXED prove to be useful. We shall illustrate the use of these procedures in the 
following examples. 

Example 12.5: Consider a first-order design for fc = 3 variables with the treatment 
combinations for the 2 3 factorial as the design points, each replicated twice in a CRD. 
The design and the observations are given in Table 12.6a. 

For the analysis we use PROC REG to estimate and test the regression coefficients. 
The results are given in Table 12.6b, with all three regression coefficients significant at 
P < .0001. 
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In order to obtain explicitly SS(LOF) we use PROC GLM with X 2 , xs as 
classification variables. In addition to specifying in the model statement X 2 , we 
also include the 3-factor interaction term x\ *^2 *^ 3 . This is a device for collecting all 
interaction terms, three 2-factor and one 3-factor interaction, into one sum of squares, 
which constitutes SS(LOF) with 4 d.f. of Table 12.1. The P-value 0.4233 indicates 
that there is no lack of fit. 

Finally, we point out that the test statistics for testing significance of the regression 
coefficients are not identical since in the first analysis [SS(LOF) 4 - SS(P_E)]/12 = 
0.1725 is used as the error term, whereas in the second analysis SS(PE)/S = 0.1675 
is used. 


EXAMPLE 12.6: Consider the CCD in two variables as given in Table 12.7a. To 
analyze the data we use PROC RSREG. The output is given in Table 12.7b. We make 
the following comments: 

(i) Simply inputting the two variables, A and B, in the model statement leads to a 
second-order model and analysis. 

(ii) Including the option “lackfit” in the model statement leads to a partitioning 

SS(E) = SS(LOF) + SS{PE) 

The results indicate that there is no lack of fit (P = .42). 

(iii) The first-order regression coefficients are significant (P = .05, and .04, respec¬ 
tively), whereas the second-order coefficients are not significant with P = .15, 
.14, and .11, respectively. 

Example 12.7: Consider the CCD as given in Table 12.8a and two situations under 
which this experiment could have been performed: (a) as a CRD in which case the 
block classification is ignored, or (b) as a split-plot type design, where factor A is the 
hard-to-change factor (see Section 12.4.6) and we have “blocks” of size 2. 

We comment on both analyses as given in Table 12.8b: 

CRD: 

(i) The number of d.f. for error equals 6 with 3 d.f. due to LOF and 3 d.f. due to 
pure error, and MS(E) = 1.05. 

(ii) The linear regression coefficients are significant with P = .016 and .013, respec¬ 
tively, whereas the quadratic and mixed regression coefficients are marginally 
significant with P — .061 , .064, .087, respectively. 
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Table 12.6 First-order Design and Analysis 


a) Input statements: 

data first; 

i 叩 ut x 1 x2 x3 y @ @; 
datalines; 


-1 -I -1 10.1 
1 - 1-1 12.0 
- ： : - ： 13.2 
11-1 14.5 
-1 1 13.4 

1 '： 1 15.3 

-1:1 16.6 
111 18.2 

/ 

run; 


-1-1 11.3 
-1-1 11.7 
1-1 12.9 
1-1 14.7 
-1 1 13.9 

-1 1 14.9 

1 116.0 
1 I 18.7 


proc reg data=first; 

model y = xl x2 x3; 

title 1 ’FIRST-ORDER DESIGN ’； 

title2 ’REGRESSION ANALYSIS ’； 

run; 


proc glm data=first; 
class xl x2 x3; 

model y = xl x2 x3 xl *x2 氺 x3/ss3; 
title2 ’TESTING FOR LACK OF FIT ’； 
run; 

b) Output: 


FIRST-ORDER DESIGN 
REGRESSION ANALYSIS 

The REG Procedure 
Model : MODEL1 
Dependent Variable : y 

Number of Observations Read 16 




Number of 

Observations Us 

ed 16 







Analysis cf Variance 







Sum of 

Mean 



Source 



DF 

Squares 

Square 

F Value 

Pr > F 

Model 



3 

84.94750 

28.31583 

164.15 

< .C001 

Error 



12 

2 _ 07000 

0.17250 



Corrected 

Total 


15 

87.0175C 





Root 

MSS 


0.41533 

R-Square 

0.9762 



Dependent 

Mean 

14 ■ 2125C 

Adj R-Sq 

0.9703 



Coef f 

Var 


2.9223C 
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<.00G1 

<.00C1 

<.0001 

<.0001 


Intercept 

xl 

x2 

x3 


14.21250 

0.78750 

1.38750 

1.6625C 


0.1C383 

0.10383 

0.10383 

0.10383 



Table 12.6 {Continued) 

Parameter Estimates 
Parameter Standard 

Variable DF Estimate Error t Value Pr > it I 


8 8 6 1 
8 5 3 0 

6 7 3 6 
3 11 



Table 12.7 Regression Analysis for CCD 


0.0347 
0.1910 
0.1411 
0.0677 


Linear 2 
Quadratic 2 
Crossproduct 1 
Total Model 5 


22.990354 0.5829 

6.778105 0.1719 

4.410000 0.1118 

34.178459 0.8666 



Sum of 

Residual DF Squares Mean Square F Value Pr > F 

Lack of Fit 3 4.656541 1.552180 2.57 0.4233 

Pure Error 1 0.605000 0.605000 

Total Error 4 5.261541 1.315385 


4 8 5 0 

7 5 3 2 

8 2 3 5 
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Table 12.7 {Continued) 


Faramerer 

DF 

Estimate 

Standard 

Error 

- Value Pr 

> !t ! 

Parameter 
Estimate 
from Coded 
Data 

Intercept 

1 

3.561442 

0.810817 

4.3S 0 

.0113 

3.561442 

A 

1 

1.143S39 

0.407534 

2.81 0 

.0485 

1.601515 

B 

i 

1.262626 

0.407534 

3.10 0 

.0363 

1.767677 

A*A 

l 

0.970668 

0.543320 

1.7 9 0 

.1485 

1.902509 

3*A 

l 

1.C5C00C 

0.573451 

1.83 0 

• 1411 

2.058000 

BxB 

l 

1.CS8219 

0.543320 

2.C2 0 

.1133 

2.152509 

Fac 

:or 

DF 

Sum cf 
Squares 

^ean Square 

F Value 

Pr > F 

A 


3 18 

.972482 

6.324161 

4.81 

0.0817 

3 


3 22 

.410532 

7.470177 

5.68 

0.0633 


Predicted value at stationary point : 3.C97108 
Stationary point is a minimum. 


Split-plot: 

(iii) The estimates of the regression parameters are the same as for the CRD, but the 

standard errors are different: they are larger for A, and B 氺 B and smaller 

for B and A 氺 B (see (iv) below). 

(iv) Since factor A is the whole-plot factor, the regression coefficients A and A* A are 
evaluated against the whole-spot error MS, which is equal to .8472 + 2 x .3065 
as obtained from the covariance parameter estimates. Factor B is the split-plot 
factor and hence the regression coefficient B is evaluated against the split-plot 
error, which is .8472. However, the regression coefficient B * B is confounded 
with the whole-plot (because the squared values of the levels for factor B do 
not change within a whole-plot), hence the larger standard error. The regression 
coefficient A^B is associated with the split-plot and hence has a smaller standard 
error. 


(v) The comments about the various standard errors in (iv) are also reflected in the 
different d.f. associated with the tests about the regression coefficients. We have 
2 d.f. for the whole-plot error and 4 d.f. for the split-plot error. 

(vi) Only the regression coefficient B is clearly significant (P = .02), whereas A 
and A * B are marginally significant (P = .11) and P = .08, respectively). □ 





0.0160 
0. C.134 
0.2486 
0.0648 
0.0865 


A 

1 

11 • 60121622 

11.60121622 

3 

1 

12.62626263 

12.62626263 

A*A 

1 

1.71667962 

1.71667962 

B*B 

A*B 

1 

5.35325423 

5.35925423 

1 

4 . 41C00C00 

4.41000000 


Table 12.8 Central Composite Design 

a) Input statements: 

data ccd; 

input A B block y; 


run; 

proc glm data=ccd; 

model y= A B A*A B*B A*B/solution; 

title 1 ’CENTRAL COMPOSITE DESIGN，； 

title2，AS COMPLETELY RADOMIZED DESIGN，； 

run; 

proc mixed data=ccd; 
class block; 

model y=A B A*A B*B A*B/solution ddfm=Satterth; 
title2，AS SPLIT-PLOT DESIGN，； 
random block; 

run; 

b) Output: 


CENTRAL COMPOSITE DESIGN 
AS COMPLETELY RADOMIZED DESIGN 

The GLM Procedure 

Number of Observations Read 12 

Number of Observations Used 12 

Dependent Variable : y 

Sum of 

Source DF Squares Mean Square F Value Pr > F 

Model 5 35.7134I26S 7.14268254 6.79 0.0186 

Error 6 6.30908731 1.05151455 

Corrected Total 11 42.C-225000C 

R-Square Coeff Var Root MSE y Mean 

0.849864 19.4395C 1.C25434 5.275000 


Source DF Type I SS Mean Square F Value Pr > F 


3 13 0 9 
0 0 6 11 



. 4.4 
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Table 12.8 (Continued) 


0.0027 

0.0160 

0.0134 

0.0614 

0.0648 

0.0865 


Intercept 

A 

B 

A*A 

B*B 


3.562529219 

0.989864865 

1.262626263 

1.010640582 

1.083796071 

1.050000000 


0.72492171 

0.29801066 

0.36437205 

0.44003131 

0.48006831 

0.51271692 


CENTRAL COMPOSITE DESIGN 3 

AS SPLIT-PLOT DESIGN 

The Mixed Procedure 

Model Information 

Data Set WORK.CCD 

Dependent Variable y 

Covariance Structure Variance Components 

Estimation Method REML 

Residual Variance Method Profile 

Fixed Effects SE Method Model-Based 

Degrees of Freedom Method Satterthwaite 


Class Level Information 
Class levels Values 

block 6 123456 


Iteration History 

Iteration Evaluations -2 Res Log Like Criterion 

0 1 29.29778783 

1 1 29.09016318 0.00000000 

Convergence criteria met. 

Cov Parrn Estimate 

block 0.3065 

Residual 0.8472 


Solution for Fixed Effects 


Effect 

Estimate 

Standard 

Error 

DF 

t Value 

Pr > 11 | 

Intercept 

3.5625 

0.8543 

2 

4.17 

0.0530 

A 

0.9899 

0.3512 

2 

2.82 

0.1062 

B 

1.2626 

0.3271 

4 

3.86 

0.0181 

A*A 

1.0106 

0.5185 

2 

1.95 

0.1906 

B-E 

1.0838 

0.5657 

2 

1.92 

0.1955 

A*B 

1.0500 

0.4602 

4 

2.28 

0.0846 



12 7 0 6 5 
9 3 4 3 2 0 

4 3 3 2 2 2 
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12.9 EXERCISES 

12.1 Consider an experiment with 5 input variables. 

(i) Obtain an appropriate 1/2 fraction of the 2° factorial to fit a first-order 
model. 

(ii) For the design chosen in (i) sketch the ANOVA table assuming that each 
design point is replicated twice. 

(iii) Suppose we need to run the experiment in blocks of size 8. Write out an 
appropriate plan, give the associated linear model and outline the ANOVA 
table. 

12.2 Consider a simplex design with fc = 4 input variables. 

(i) Write out explicitly the design-model matrix D. 

(ii) Outline the ANOVA table with r = 2 replications for each design point. 

12.3 Consider a central composite design for an experiment with five input variables. 
Show that with a 1/2 fraction of resolution V of the 2° as the factorial part of the 
design all linear, quadratic and linear x linear effects [see model (12.15)] can be 
estimated. 

12.4 For an experiment with four input variables, construct a Box-Behnken design 
using the following BIBD with four treatments and six blocks of size 2: 


Blocks 


Treatment 1 2 3 4 5 6 
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CHAPTER 13 

Split-Plot Type Designs 


13.1 INTRODUCTION 


In all the error-control designs discussed so far we have had one type of EU for all the 
treatments and one randomization process to assign the treatments to the EUs. There 
exist, however, many situations where for a factorial experiment different types of EUs 
are being used and where the levels of some factors are applied sequentially, necessi¬ 
tating separate randomization procedures. We have already pointed to such situations 
in Sections 12.4.6 and 2.3.2. In the simplest case we have EUs of one size for the levels 
of one of two factors. Those EUs are then subdivided into smaller EUs to which the 
levels of the second factor are applied. This procedure is referred to as the split-unit 
principle. The following is an example of such a situation. 


EXAMPLE 13.1: Suppose we want to investigate the breaking strength of dinnerware 
manufactured by using different chemical compounds and baking it at different temper¬ 
atures. Let temperature be factor A with three levels, say a\ = 400°. 02 = 500°, <23 = 
600°, and let factor C denote chemical compounds with levels ci, C 2 , C 3 , C 4 , say, each 
being a specified chemical compound. We have three furnaces available. Each fur¬ 
nace will be set at one of the randomly assigned temperatures. In each furnace we 
then place four dinner plates each individually produced using a different (randomly 
assigned) chemical compound for each plate. This process is repeated on several days. 
For each plate the breaking strength is then determined using a suitable machine. 

The large EUs are furnaces and the smaller EUs are the dinner plates: 


^3 


Day 1 

C 3 

C4 


C2 

Cl 


ai 


Day 2 1 

c 4 

Cl 


C2 

C3 


Furnace 

ai 


Cl 

C 4 

C2 

C 3 


«3 


C2 

Cl 

C 4 

C3 


d2 


C3 

Cl 

C2 

c 4 


«2 


c 4 

C 2 

C3 

Cl 
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etc. 
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If we repeat this process on r days we have r replications of each temperature, but 3r 
replications for each chemical compound. □ 

Not only will this type of arrangement lead to different precisions for the compar¬ 
isons among the levels of the A-factor and among those of the C-factor, but the fact 
that the two factors are associated with different types of EUs leads to different experi¬ 
mental error variances associated with these comparisons. This is the reason why these 
types of experiments must be distinguished very carefully from the factorial experi¬ 
ments described in Chapter 11 (from a purely technical point of view there exists a link 
between these two types of experiments through the notion of interblock information, 
which is discussed in Chapters II.7 - 11). 

We shall now describe some specific forms of designs which use different types of 
experimental units. 

13.2 SIMPLE SPLIT-PLOT DESIGN 

This design was developed and used first and foremost for agricultural, mainly agro¬ 
nomic experiments (see Yates, 1935 and 1937), but its applicability goes now across 
all fields of experimental research. Even so, the terminology for this design still makes 
references to plots of various types, but the reader should have no difficulty translating 
this into any other subject matter area. 

13.2.1 Superimposing Two Randomized Complete Block Designs 

We have two treatment factors A and B, with levels ai, < 22 , .. a a and &i ， 62 , ...，〜， 
respectively. Factor A is referred to as the whole-plot factor and the EUs to which the 
levels of A are applied are the whole-plots. Factor B is the split-plot factor and the 
EUs to which the levels of B are applied are the split-plots ， each whole-plot having b 
split-plots as illustrated below for 6 = 4: 

whole-plot —> 卞 

split-plot 

A replicate consists then of one application of each level ai. a 2 ,... .a a and within each 
of the a whole-plots of one application of each level bi, 62 , • • • *, h ， And the design 
consists then of r such replications. 

It is useful to think of this arrangement as superimposing one RCBD on top of an¬ 
other RCBD. For the first RCBD, involving the whole-plots and the whole-plot factor, 
we have 

RCBDj : t = a. number of blocks = r 

and for the second RCBD, involving the split-plots and the split-plot factor, we have 


RCBDb : t ~b. number of blocks = ra 


This brings out the fact that two independent randomizations are being used. 
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This structure suggests the partitioning of the rab — 1 d.f. available from the rab 
observations in the following way. If we consider first the RCBD^ we have the parti¬ 
tion 

Source d.f. 

Blocks (whole plots) ra - 1 

5-factor b - 1 


Residual (B) 

(ra — 1)(6 - 1) 

Total 

rab — 1 

We realize, however, that the systematic differences among blocks in a replicate are 
due only to the different levels of factor A. This and the fact that the replicates form 
the blocks for the RCBDa implies that we have the following partition of the ra — 1 
d.f. for whole-plots 

Source 

d.f. 

Replicates 

r — 1 

A-factor 

a — 1 

Error (^4) 

(r - l)(a-l) 

Whole-plots 

ra — 1 

It follows from this partitioning that the (rt 
partitioned further into 

2 — 1)(6 — 1) d.f. for Residual (B) can be 


Source 

d.f. 

Replicates x B 

(r- l)(fe- 1) 

Ax B 

(a - 1)(6- 1) 

Ettot(A) x B 

(r-l)(a-l)(6-l) 

Residual (B) 

(ra — 1)(6 — 1) 


Assuming no replicate x B interaction (since we are assuming unit-treatment addi¬ 
tivity), we then have the complete partitioning of the d.f. as given in the ANOVA 
of Table 13.1. The associated sums of squares and their properties can be derived as 
follows, based on the observations arising from the rab split-plots. 
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13.2.2 Derived Linear Model 

We denote the conceptual response of the t，th split-plot in the uth whole-plot of the 
zth replicate to which the jth level of the whole-plot factor A and the kth level of the 
split-plot factor B have been applied by x iuv jk. Assuming unit-treatment additivity we 
write 

^iuvjk = Uiuv + Tjk ， (13.1) 

where Ui UV is the unit contribution and Tj^ is the treatment contribution. We then write 
further, using obvious notation, 

Uiuv = U... + {Ui.. - f/...) + (Uiu. - Ui.) + (Ui UV - Uiu) (13.2) 
and 

T jk = T. + (Tj, - T.) + (T k - fj + (T jk ^ fj. - T k + fj. (13.3) 
Substituting (13.2) and (13.3) into (13.1) and defining 


Ui..~U...=r 7 


the effect of the zth replicate, 

Tj. - ?• = a i 

the effect of the jth level of A, 

f k - T. = 0 k 

the effect of the kth level of B, 


Tjk - fj. — T.k + T. = {oi(3)jk 


the interaction effect between the jth level of A and the kth level of B, we obtain 

Xi UV jk = /x 十 n + {Ui U , ~ Ui,,) + 0k + {oi0)jk 4 - {Ui UV — Ui U ). (13.4) 

We actually observe yijk ，the response of the (jk) treatment combination in replicate 
i. The observed and conceptual responses are linked to each other by two design ran¬ 
dom variables associated with the randomization processes of factors A and B, respec¬ 
tively. Let 



if level j of factor A is applied to the uth whole-plot in replicate i 
otherwise 


and 

s iL = 


if level k of factor B is applied to the ^th split-plot in the uth 
whole-plot of replicate i given that the jth level of A has been applied to 
that whole-plot 


otherwise. 
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Then ^ 

Vijk = ^iu f ^iuv X iuvjk j - (13.5) 

u=l \r=l / 

Substituting (13.4) into (13.5) we obtain 

Vijk ~ Vi - {- CXj 4~ 7Jij + + + C,ij fc, (13.6) 

where 

% = S >' L (心.-良 .） 

u 

and 

Cm = - u lu .). 

u,v 

The r]ij and Qjk are the two errors arising from the fact that we have two types of EUs 
and two independent randomization processes. Following arguments similar to those 
given for the RCBD (Chapter 9) we can derive easily the distributional properties of 
these errors. For example, it is obvious that 

ER{rfij) = 0 , E R (Cijk) = 0 

and that the rjij's are correlated and the Qjk's are correlated. Both types of errors con¬ 
stitute only unit errors to which we may add the technical errors, in this case treatment 
errors for factors A and B, respectively, and observational error. We indicate this by 
rewriting (13.6) to obtain the final model 

Vijk = M + r i + + (a0)jk efj k . (13.7) 

For all purposes of inference about the treatment effects we may treat the efj and ef^ k 
as if they were i.i.d. with means 0 and variances a\ A and respectively. This leads 
to the E(MS) in Table 13.1. " 

13.2.3 Testing of Hypotheses 

The forms of the 五 (MS) indicate the appropriate tests of significance. Relying on the 
approximation of the randomization test by the F -test we test 

(i) Ho ： ai = = a a = 0 by 

_ MS ⑷ _ 

’ = MS(E a ) 〜 

(ii) Ho：/?i =/3 2 - ••■ = A = 0by 

MS(S)_ 

~ MS{E B ) ^ b-Ur-l)a(b-l), 

(iii) H 0 ： all {a0) jk = 0 by 

MS(^ x B) 

MS(£^) 〜 f (a-l)(b—l) ， (r-l)a(6-1) • 
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13.2.4 Estimating Treatment Contrasts 

Since the split-plot design is an orthogonal design, treatment contrasts are estimated 
simply by the corresponding contrasts of appropriate treatment means and their vari¬ 
ances are obtained by using model (13.7) and its properties: 


(i) The contrast EjCjOij (EjCj = 0) among whole-plot treatment effects is esti¬ 
mated by EjCjy.j,. Since 

va 柄 .)= ^{a 2 eB + b(j 2 eA ) 
for every i = 1,2, … ， r; j = 1,2,.... a, we have 


var 


曲 = 


1b + hcr lA 


rb 


and hence, from Table 13.1 ， 


var 


^2 响 = 


MS{E a ) 

rb 


(13.8) 


(13.9) 


(ii) The contrast (^kdk = 0) among the split-plot treatment effects is esti¬ 

mated by 嫌 Now 


^ky..k — — ^ dk ^2 Vijk 


h^ dk 


rap + ^2etj+ ra0 k + 


^ijk 




h3 


^2 dk3k + ~ 13 dke ijk 


i ， j,k 


and hence 


and 



(13.10) 

(13.11) 


(iii) We now consider a contrast among split-plot effects averaged over a set of p(p < 
a) whole-plot treatments. If we write 


Tjk = Pk 


( 13 . 12 ) 
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and 



这 Tjk 


(to simplify notation we have averaged over the first p whole-plot treatments), 
we then consider with Edfc = 0. The estimator for this contrast is 


工 4 




rp 




》: Vijk 




rp 




rpn 


+ r J2 a j+J2 e i L j + rpi3k + r 乞 ㈣ hk + X I 




%3 






a + ~ y^( Q ^)jfc 

3 


i,3-k 


with 


and 


^(E rf 4 p) ) - I>- a + ^D ⑽) 开 

\ k ) k L P ^ 

var^4,4 P)> ) - (13.13) 

\ k / P k 





(13.14) 


(iv) We may also consider a contrast among whole-plot treatment effects averaged 
over a set ofq(q< b) split-plot treatments. If we write, using (13.12), 


n 



(to simplify notation we have averaged over the first q split-plot treatments), we 
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are then interested in with SjC 7 - = 0. The estimator for this contrast is 

C j yijk 
i,k 

1 q q 

+ +( i ， l2 e fj +r J2^+ r Yl ㈣ h + Y, e ijk 

j i k=l k=l i,k 

二 12 明 + lJ2 c J e fj + JlZ c j( a ^)jk + 

j i,j j，fc ^ i,j,k 

with 

E ^ a ! ?) j = H ° 2 j a i + - 舛 

var ^c,aj 9) j - ^^.4 + ^ ^ ( 13 . 15 ) 

We know from Table 13.1 that 

a； B = MS(E b ) 
and 

a e 2 ^ = i ： MS(E 4 )-MS(£B)] 

(if MS(£^ 5 ) > MS(Ea) we take d\ A — 0 ). 

We then estimate (13.15) as 

V&r (Z C i d J 9) j =^Y1 c2 MeA+^ 2 e B } 

= -Tc] \ q ^ EA lz MS ^M + ms(e b ) 

rq J [ 0 

= MS(£ a ) + ^^MS(Eb) . (13.18) 

r j ^ ' 

(v) Occasionally, a contrast of the form Tjk — 丁 yk'ij ★ J") is of interest. Obviously, 

于 jk — 子 j’k’ = y.jk — y.j'k' 

=aj - Oiy + - y^X e ti — e tj f ) + 0k - 0k' 

i 

+ i a P)jk - {oL0)j>k> + - y^S e Fik — e ij f k') 


(13.16) 

(13.17) 
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with 

vwc(fjk - Tyk') = ~<y 2 eA + ~ a eB (13.19) 

r r 

and, using (13.16) and (13.17), 

var(f^ - Ty k ，) = ^- b [MS(E A ) + (b - 1)MS (五 B )]. (13.20) 

13.2.5 Testing Hypotheses about Treatment Contrasts 

Of the contrasts described above, (i) and (ii) are usually of most interest. Tests of 
significance about or confidence intervals for them can be obtained by referring to t- 
statistics with the appropriate d.f., (r — l)(a — 1) for (i) and (r — l)a(b — 1) for (ii). 
We should mention, however, that inferences about the a/s or the (3k^ may not always 
be meaningful when the A x 5 interaction is significant. Careful examination of the 
kind of interaction present is necessary before proceeding to the inference about main 
effects. If such inferences are not appropriate contrasts of the form described in (iii) 
and (iv) may be more useful, often with p = q = 1. There is no difficulty in dealing 
with (iii) using the t-statistic with (r — l)a(b — 1) d.f. However, there does not exist 
an exact test for the contrast given in (iv). A reasonable method to use is to form the 
^-statistic in the usual way, that is, 


(13.21) 


and then compare (13.21) with the following weighted critical f-value: 

MS(£^4)t( r 一 i)( a _i) ， Q ； + 亡 ( r _i) a (b 一 1),0 ； 

t a = - , q _ --—— , (13.22) 

MS(£U) + — -MS(E b ) 

Q 

where t v ^ refers to the a-percentage point of the ^-distribution with v d.f. (for exam¬ 
ple, Cochran and Cox, 1957). 

Another method to use is that suggested by Satterthwaite (1947), that is, to compute 
(13,21) and approximate its distribution by that of a t-statistic with v d.f. where 

■ 7 ' 2 

MS{E a ) + ^^-MS{Eb) 

v =—^ --- ^~~2 • (13.23) 

[MS ㈣]' [ 宁轉 a) 

(r — l)(a — 1) (r — l)a(b — 1) 
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The same procedure can be used for inference about a contrast of type (v), using q = l. 
As mentioned earlier, this comparison is only occasionally of interest, for example to 
compare a control treatment with a particular treatment combination. 

We conclude this section by pointing out that the whole-plot treatments themselves 
can have a factorial structure and the same is possible for the split-plot treatments. The 
reader should have no difficulty modifying the ANOVA and hence making use of the 
factorial structures to analyze such an experiment. 


13.3 RELATIVE EFFICIENCY OF 
SPLIT-PLOT DESIGN 

Under most circumstances the split-plot design is used for purely technical and prac¬ 
tical reasons, as the levels of some factor can be applied only to large EUs which can 
then be “split” into smaller EUs for application of the levels of the other factor. This 
includes also the distinction between hard-to-change and easy-to-change factors in in¬ 
dustrial experimentation (see Section 12.4.6). It is, however, of interest to evaluate the 
efficiency of the split-plot design relative to the RCBD with r blocks. The question 
then is: Given that we have carried out a split-plot experiment, what would have been 
MS(-E) for the RCBD? This, of course, determines how much information would have 
been available for all treatment comparisons. We see from Table 13.2, using a unifor¬ 
mity trial for both situations, that is, pooling treatment sums of squares with appropriate 
error sums of squares, that 


r(ab- l)MS(E) = r(a — 1)MS(£U) + ra(b — 1)MS(E B ) 


and hence 


MS ⑻ = 


(a- 1)MS ㈣ + a(6 - 1)MS(E B ) 
ab — 1 


(13.24) 


The information on all treatment comparisons in a RCBD would then have been pro¬ 
portional to 1/MS(E), whereas in the split-plot design information on whole-plot treat¬ 
ment comparisons is proportional to 1/MS(Ea) and on split-plot treatment compar¬ 
isons and interaction proportional to 1 /MS(_E_b). Since MS(E) is a weighted average 
of MS(Ea) and MS(Eb), and since MS(_Ea) is usually greater than MS(Eb) (ex¬ 
cept for sampling errors), MS(E) will be intermediate in size between MS(Ea) and 
MS{E b ). — 

We can then state the results concerning relative efficiencies of the split-plot design 
versus the RCBD as follows: For A-factor comparisons we have 


EREj (Split-plot design vs. RCBD)= 


MS(E) 

MS(E a ) 


< 1 


(13.25) 


and for B-factor and AxB comparisons we have 
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Table 13.2 ANOVA for Uniformity Trial 


(a) Split-plot design 

Source d.f. MS 


Replicates 
Error (A) 
Error (JB) 

r — 1 

r(a — 
ra(b - 

1) MS(Ea) 

-1) MS{E b ) 

Total 

rab — 

1 

(b) RCBD 
Source 

d.f. 

MS 

Blocks r 

Error r 

-1 

(ab — 1) 

MS ⑹ 

Total rab — 1 



EREs (Split-plot design vs. RCBD)= 


MS(E) 

MS(E b ) 


> 1 . 


(13.26) 


Results (13.25) and (13.26) express the obvious: Although the average information 
is the same for both designs, the information on whole-plot treatment comparisons is 
less s precise in the split-plot design than in the RCBD, whereas the opposite is true 
for split-plot treatment and interaction comparisons. Hence, unless practical reasons 
dictate the use of a split-plot design or one is more interested in one factor than the 
other, use of a RCBD seems preferable. 

13.4 OTHER FORMS OF 
SPLIT-PLOT DESIGNS 

We mentioned in Section 13.2 that in order to better understand the structure of the 
simple split-plot design it is advantageous to view it as superimposing one RCBD (for 
the split-plot treatments) on top of another RCBD (for the whole-plot treatments). We 
shall refer to this as a SPD(RCBD, RCBD). Variations of this form of split-plot design 
are possible by using different component designs, other than both RCBD. Some useful 
combinations are indicated below (where IBD refers to incomplete block design) and 
discussed in the section indicated. 
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Error-control design for Error-control design for 


whole-plot treatment 

split-plot treatment 

Section 

CRD 

RCBD 

13.4.1 

CRD 

LSD 

13.4.3 

LSD 

RCBD 

13.4.4 

CRD 

IBD 

13.4.5 

GRBD 

RCBD 

13,4.6 

GRBD 

IBD 

13.4.7 

IBD 

RCBD 

13.4.8 

RCBD 

GRBD 

13.4.9 


13.4.1 SPD(CRD, RCBD) 

Each level of the A-factor is randomly assigned to r whole-plots and within each 
whole-plot the b levels of the B-factor are randomly applied to the split-plots. In this 
situation the whole-plots are often subjects. Each subject is given a certain treatment, 
that is, one of the levels of the whole-plot factor (^-factor) such that each level of the 
^-factor is applied at random to r subjects. Then each subject will receive, in random 
order, sequentially all b levels of the S-factor. It is for this reason that this type of de¬ 
sign is often referred to as a between-and-within-subjects design, where the ^4-factor is 
referred to as the between-subjects factor and the 5-factor is referred to as the within- 
subjects factor. A diagram of the structure of the data from such an experiment is given 
in Figure 13,1 (ignoring randomization). 

A suitable model is of the form 

Vijk = //• -f OLi + efj + /3/c + {o ： p)ik + efj k (13.27) 

or 

Vijk = " + 叫 + + (a,3)ik -h efj k . 

where s 勿 represents the effect of the jth subject receiving the ? th level of the 乂 -factor, 
(z = 1, 2,...,a; j = 1.2,... ,r; /c = 1,2,.... 6). The ANOVA is given in Table 13.3. 

13.4.2 Split-Plot Design in Time 

In some types of experiments subjects (EUs) are given a certain treatment, a dietary 
regimen for example. Observations (say weight) are then made at specified times (for 
instance, every month for one year). The design for such an experiment is usually of the 
form of an SPD(RCBD, RCBD) or SPD(CRD ， RCBD) and is, therefore, often referred 
to as a “split-plot design in time，’’ where the treatment is the ^-factor and the times 
are considered to be the “levels of the B-factor.” There are a number of problems with 
this viewpoint. First, the “J5-levels” are obviously not randomized. Secondly, and even 
more importantly, there exists a covariance structure for the observations and hence 
for the errors other than the one ordinarily induced by the randomization procedure. 
This may invalidate the analysis outlined above, and only if the covariance structure 
satisfies the Huynh-Feldt conditions (Huynh and Feldt, 1970) do MS(B)/MS(Eb) 
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Figure 13.1 Between-and-within-subjects design. 


Table 13.3 ANOVA for SPD(CRD, RCBD) 


Source 

d.f. 

SS 

E(MS) 

A-factor 

a — 1 

rbJ2(Vi.. - V...) 2 

i 

^Ib + b<j2 eA + - 1) 

Error (A) 

a(r — 1) 

bTXVij. - Vi..) 2 
ij 

十 b ^A 

S-factor 

6-1 

raYl{y..k - y...) 2 

k 

<^Ib 1) 

Ax B 

(a-1)(6-1) 

r J2(vi k - Vi.. - y..k + y...) 2 

i，fc 

g \b + r Yl( 0c ^)'ikj/[( a - l) • {b - 1)] 

Error (B) 

a(r — 1)(6 — 1) 

YKVijk — Vij. — Vi.k + Vi..) 2 

ijk 

-e 2 B 

Total 

rab — 1 

12 {ytjk — y...) 2 
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Figure 13.2 Schematic representation of a SPD(CRD ， LSD) with a = 2, 6 = 4. 

and MS(A x B)/MS(Eb) have F-distributions. These designs are called repeated 
measures designs (for a discussion see Chapter 14). 

13.4.3 SPD(CRD, LSD) 

Experiments in psychology and human factors engineering are often performed using 
this kind of design or variations of it. Rather than assign the levels of the split-plot 
factor randomly to the split-plots within each whole-plot as in the SPD(CRD, RCBD), 
they are assigned according to a Latin square design as follows. First, each level of the 
>1-factor is randomly assigned to r = b whole-plots. If it is suspected that the order 
of application of the ^-levels within each whole-plot has a systematic effect on the 
outcome then a Latin square arrangement of the following type may be used. For each 
A-level the b whole-plots form the rows of an LSD and the orders of application form 
the columns of the LSD. For each A-level we thus have an LSD of size b and the b 2 
row-column combinations represent the split-plots to which the 5-levels are assigned 
according to a randomly selected b x b LSD. For a = 2 and 6 = 4 the design can be 
represented as in Figure 13.2. 

The LSDs given in Figure 13.2 are actually of a specific type. They are sometimes 
referred to as completely counter-balanced or diagram-balanced (Wagenaar, 1969) 
and are constructed following a method due to Williams (1949) (see Section 10.7.2). 
The special feature of such an LSD is that each treatment precedes and follows every 
other treatment exactly once in the order of application. This is useful when learning 
effects or carry-over effects are suspected. 

An example of the design described here might be a psychological experiment in 
which subjects (Sij) are given different types of training (education), represented by 
the A-levels, and following that each subject performs sequentially a number of tasks 
(tests), the same for each subject and represented by the B-levels. It is suspected that 
a learning effect takes place. This means that subjects may respond differently to the 
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Table 13.4 ANOVA for SPD(CRD ， LSD) 


Source 

d,f. 


ss 

E(MS) 

•A-factor 

a — 1 

b 2 [X 仏 … 

i 

— y...) 2 

^Ib + b(j2 eA + E Oi 2 J{a - 1) 

i 

Error (A) 

a(b - 1) 

- 

i，j 

-Vi...? 

^Ib + 

Order (0) 

6-1 

ab ^2(y..k. 
k 

— y..) 2 


S-factor 

b - 1 

i 

— y. .) 2 

^eB + ab H 3f/{b - 1) 
l 

Ax B 

(a - 1)(6- 1) 

i,l 

■ yi... — y...i + y... ) 2 

i,l 

Ax O 

(a-l)(b-l) 

bJ2(Vi.k_ - 

i,k 

- Vi... — y..k. + y...) 2 

^Ib + b E(a7)? fc /(a — 1)(6 - 1) 

i.k 

Error (B) 

a(b — 1)(6 — 2) 

Difference 



Total 

ab 2 — 1 

H (yijk(i) — y...) 2 
ijk(l) 



same task given at different times (order). 

An appropriate model for this design can be written as 

Uijk(i) = " + + e 合 + 7/c + _/?， + (a,3)u -f (cry)ifc + ^fjk(i)- (13.28) 

where jk represents the kth order effect (i = 1,2,..., a:j, k. I = 1,2,..., b). This 
leads to the ANOVA given in Table 13.4. Model (13.28) includes a term for factor 
A x order interaction, It reflects differences among the “learning curves” for 

the different levels of the A-factor. If such differences are assumed to not exist then 
SS(A x O) in Table 13.4 can be pooled with SS(Eb)> 

Just as the SPD(CRD, RCBD) the SPD(CRD, LSD) is sometimes also referred to as 
a between-and-within-subjects design or mixed factorial design (for example, Keppel 
and Zedeck, 1989), where the A-factor is the between-subjects factor and the 5-factor 
is the within-subjects factor. 

The SPD(CRD, LSD) bears a certain resemblance to the replicated LSDs except 
that we have here two different randomization procedures. As explained for the SPD(RCBD, 
RCBD) this leads to two error terms rather than one for the repeated LSD (see Sec¬ 
tion 10.3). 

13.4.4 SPD(LSD ， RCBD) 

The whole-plots are arranged in an a x a Latin square and the levels of the A-factor 
are assigned in accordance with the randomization procedures for the LSD (see Chap¬ 
ter 10). In each whole-plot the levels of the 5-factor are applied to the split-plots 
according to a RCBD. A suitable model is of the form 

Vijkl = M + Pi + Ij + 叫 + e ijk + 01 + ( a P)kl + e fjkl (13.29) 
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Table 13.5 ANOVA for SPD(LSD ， RCBD) 


Source 

d.f. 


SS 


E(MS) 

Rows 

a — 1 

ab Z(Vi... - y....' 

I 2 



Columns 

a — 1 

- y....'. 

» 2 



^4-factor 

a — 1 

- y.... 

) 2 


-r bo-eA ^ ab ^ a k^ a ~ ^ 

Error (A) 

(a - l)(a - 2) 

b T. (Vijk. - 5i... 

l J k 

a 2 E(5...Z - 

- y.j.. - y..k. + 2 沒 … 

•) 2 4 丑 

T ^lA 

S-factor 

6-1 

) 2 

+a 2 Z^f/(b~ 1) 

A X B 

(a - 1)(6 - 1) 

« - v..k. 

一 y...i + y ....) 2 

a2 eB 

^ a T.{ Q 3) 2 kl /(a - 1)(5 - 1) 

Error (B ) 

a(a — 1)(6 — 1) 

Difference 


a eB 


Total 

a^b — 1 

E^Vijki - y- -- - 

) 2 




(i.j,k m 1,2’ ...， a; Z = 1,2, ... . b). The ANOVA is given in Table 13.5. Where 
applicable this is rather effective in increasing the precision for whole-plot treatment 
comparisons (Yates, 1935). 

As an example for this design we can envision an agronomic experiment where the 
experimental material (for example, field) requires blocking in two directions (rows and 
columns). The ^4-factor may represent different soil treatments such as no-till, shallow 
plowing, deep plowing (a. = 3). The 丑 -factor could be different types of fertilizer. 
The layout for 6 = 2 is given in Figure 13.3. 

13.4.5 SPD(CRD, IBD) 

This design is useful if the number of split-plots in a whole-plot is less than 6, say 
K. A suitable arrangement might then be that the r replications for each whole-plot 
treatment form an IBD for the split-plot treatments, the IBD (apart from randomization) 
being the same for each level of the ^4-factor. For example, Robinson (1967) considers 
the specific case of a BIBD (b ， r. K, R; X), that is, each split-plot treatment occurs 
R(< r) times with each whole-plot treatment, and each pair of split-plot treatments 
occurs together A times with each whole-plot treatment. 

As an example consider a = 3,6 = 4. And suppose each whole-plot contains only 
K — 2 split-plots. Suppose further that each level of the A-factor is replicated r = 6 
times. We can then use the following BIBD (4, 6, 2, 3; 1) for the 5-factor: 

1112 2 3 
2 3 4 3 4 4 

(where each column represents a block) in such a way that the two treatments in a 
block are assigned randomly to the split-plots in a whole-plot and for each level of the 
A-factor the six blocks are assigned at random to the six replicates. The layout (apart 
from randomization) is given in Figure 13.4. 

The model for this design is the same as (13.27) except that not all combinations 
(ijk) occur, but the analysis becomes now more complicated since the ^4-factor and 
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Figure 13.3 Layout for SPD(LSD ， RCBD) for a = 3, 6 = 2. 


the 5-factor are no longer orthogonal to each other. Partial sums of squares have to 
be obtained using the methods of Chapter 9 and Chapter II.l. We shall not go into 
the details here, but refer to Robinson (1967). A sketch of the ANOVA is given in 
Table 13.6. : 

We mention here that the IBD as the split-plot design needs to be chosen carefully. 
By that we mean that each level of the B-factor must occur with each level of the A- 
factor. Otherwise we cannot estimate all the interaction terms, and hence the d.f. for 
A x B will be less than (a — 1)(6 — 1). 

13.4.6 SPD(GRBD ， RCBD) 

This design is similar to the SPD(CRD, RCBD) with v' replications of each of the 
whole plot factor levels in that each replicate constitutes a SPD(CRD, RCBD). The 
main advantage of this design is that we can now test for Rep x 乂 ， Rep x and Rep xAx 
B interaction，using the following model 

Vijki = p + A + 巧 + (ra)ij + efj k 

+ 沩 + + {r(3)ii + {rap)iji + ef jkl (13.30) 

(2 = 1 2, .. r; j 二 1 ， 2, •. . ， a; fc 二 1, 2, .. . ， r ’； Z = 1 ， 2, . • . ， 6). This is, of course, 
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Figure 13.4 Layout of SPD(CRD, IBD). 


Table 13.6 Outline of ANOVA for 
SPD(CRD, BIRD) 


Source 

d.f. 

A-factor 

a - 1 

Error ⑷ 

a(r — 1) 

■B-factor 

6-1 

Ax B 

(a - 1)(6 - 1) 

Error(B) 

a(Rb — b — r + 1) 

Total 

abR — 1 
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Table 13.7 Outline of ANOVA for 
SPD(GRBD, RCBD) 


Source 

d.f. 

Rep 

r — 1 

A 

a — 1 

Rep x A 

(a - 1 ) 0 - 1 ) 

Error(A) 

ar{r' - 1 ) 

B 

6-1 

Ax B 

(a - 1 )( 6 - 1 ) 

Rep x B 

(r - 1)(6- 1) 

Rep x Ax B 

(r-l)(a-1)(6-1) 

Error ( 丑） 

ar(r f — 1)(6 — 1) 

Total 

rr r ab — 1 


important if the replication factor is an intrinsic factor and if the type of interactions 
mentioned above are important. The outline of the ANOVA is given in Table 13.7. 

13.4.7 SPD(GRBD ， IBD) 

This design is similar to the SPD(CRD, IBD) in that each replicate constitutes a SPD 
(CRD, IBD). In each replicate each whole-plot treatment is applied to, say, r f whole- 
plots and superimposed upon these is then an IBD, for example, a BIBD or PBIBD, for 
the split-plot treatments. This, too, is a nonorthogonal design and sums of squares in the 
ANOVA must be obtained from first principles (see Chapter 4 and also Chapter II. 1). 
An outline of the ANOVA is given in Table 13.8 for the SPD(GRBD, BIBD( 6 , r\ K, 
R\ A)). 

A special application of a SPD(GRBD, IBD) arises, for example, when the 丑 -factor 
(that is, the split-plot factor) itself has a factorial structure and a system of confounding 
has to be used. To illustrate such a procedure we give a simple example. 

Suppose we have three levels ai. a 2,03 for the A-factor and a 2 3 factorial for the 
5-factor. Let us denote those factors by C with levels co, ci, D with levels do, <ii, and 
E with levels eo, Suppose now we have r' — 2 applications of each level of A in 
each replicate and we have whole-plots with only four split-plots. Using the methods 
discussed in Chapter 11 the procedure to use is quite straightforward, namely to con¬ 
found the 3-factor interaction CDE with whole-plots, assuming that this interaction is 
of less importance than main effects and 2-factor interactions. This leads (apart from 
randomization) to the following arrangement for one replicate: 
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Table 13.8 Outline of ANOVA for 
SPD(GRBD, BIBD) 


Source 

d.f. 

Replicates 

r — 1 

^-factor 

a — 1 

Error (A) 

(a — l)(r — 1) + ar(r’ _ 1) 

B-factor 

b-1 

AxB 

(a - 1)(6- 1) 

Error(B) 

a(rRb — b — rr f + 1) 

Total 

arbR — 1 


The ANOVA for this design is given in Table 13.9. This is an orthogonal design 
and hence all sums of squares are easily obtainable using the usual procedures. The 
important feature of this design is that the five d.f. among whole-plots within a replicate 
can be partitioned into d.f. for A, CDE (since it is confounded with whole plots), and 
A x CDE. This is an example of “recovery of interblock information,” a procedure 
discussed in Chapters II. 1 and 8-11. 


13.4.8 SPD(IBD, RCBD) 

This situation may arise in the following context: Suppose we want to investigate and 
compare different therapeutic treatments consisting of a combination of inoculation 
and ointment. We propose to use identical twins for this study. We have three different 
substances, say ai, a 2 , as, for the inoculation, each individual receiving one substance, 
and we have two ointments, say 61 ， 62 , each being applied to one arm of each individ¬ 
ual. In other words, the whole-plots are the individuals and the split-plots are the arms 
of each individual. Schematically the arrangement of the treatment combinations may 
be represented as follows: 




554 


CHAPTER 13. SPLIT-PLOT TYPE DESIGNS 


Arm 


Twin pair Individual Innoculate Left Right 


This basic pattern may be replicated r times, using proper randomization. The IBD 
used here is obviously a BIBD (3, 3, 2, 1; 1) or, for the entire experiment, a BIBD (3, 
3r, 2, 2r; r). The ANOVA for this design is outlined in Table 13.10. Again, this is a 
nonorthogonal design. 


Table 13.9 ANOVA for SPD(GRBD, IBD), 
Using a System of Confounding 


Source 

d.f. 

E(MS) 

Replicates 

r — 1 


A 

2 


CDE 

1 


A x CDE 

2 


Error (A) 

5(r — 1) 

<^eB + rry lA 

C,D,E \ 

cd,ce,de] 

6 


AxC.AxD.AxEA 
A x CD, Ax CE, l 

Ax DE j 

12 


Error ⑼ 

18(r- 1) 

g Ib 

Total 

24r- 1 



13.4.9 SPD(RCBD ， GRBD) 

If the whole-plots can be divided into more than B sub-plots then the design for the 
split-plot factor B may be a GRBD with 〆 replicates for each of its b levels, or some 
form of extended block design as discussed in Section 9.8.5 


2 1112 1 
To To To To To To 

1 2 2 2 1 2 
To lo 6 6 6 6 

2 113 3 2 

a a a a a a 

i ―- 2 ^ —- 2 11 2 


12 3 
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Table 13.10 Outline of ANOVAfor SPD(BIBD, 
RCBD) using BIRD (3,3r, 2, 2r; r) 


Source 

d.f. 

Replicates 

r — 1 

Pairs/replicates 

2r 

A-factor 

2 

Error ⑷ 

6 r — 3 — 3r + 1 = 3r — 2 

B-factor 

1 

Ax B 

2 

Error ⑻ 

(6r - 1) - 2 = 3(2r - 1) 

Total 

12r- 1 


An outline of the ANOVAfor the SPD(RCBD, GRBD) is given in Table 13.11. We 
have included here RepxB and Repx^4 x B as separate sources of variation. They 
may, of course, be pooled with the Error(B) if these interactions are considered to be 
negligible. 

13.4.10 Summary 

The designs given in this section represent obviously only a few examples of different 
forms of split-plot designs. The reader should have no difficulty thinking of other 
examples or of considering the examples given above more generally. The important 
point is that it is useful to represent split-plot designs as superimposing two suitable 
error-reduction designs. Those component designs should be chosen to best suit the 
experimental situation present. 

13.5 SPLIT-BLOCK DESIGN 

Unfortunately, the terminology for error-reduction designs using the split-unit principle 
is not quite uniform. The design we shall discuss now is known as a split-block design 
and also as a split-plot design in strips. It represents a variation of the simple split-plot 
design discussed in Section 13.2. 

13.5.1 The Layout 

The basic difference, and it is an important one, between the simple split-plot design 
and the split-block design is the way in which the levels of the two treatment factors 
are assigned to EUs. In this case both factors are applied to whole-plots which are 
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Table 13.11 Outline of ANOVA for 
SPD(RCBD, GRBD) 


Source 

d.f. 

Rep 

r — 1 

A 

a~ 1 

Error(A) 

(r - 1 )(q - 1 ) 

B 

b~l 

Ax B 

(a - 1)(6- 1) 

RepxB 

(r - l)(b- 1) 

RepxA x B 

(r~ l)(a- 1 )( 6 - 1 ) 

Error(B) 

rab{r f — 1) 


Total rar’b — 1 


“orthogonal” to each other. Schematically，this can be represented as follows, for factor 
A with levels ai,a- 2 ,..., a Q (a = 8 ) and factor B with levels &i, 62 ,..., bb{b = 5): 


0.4 


a 8 


o-i 


a L 


0^2 




a-, 


An example of such an arrangement, where the levels of both factors are applied ran¬ 
domly to two types of whole-plots and the observations are obtained on the split-plots 
(determined by the intersection of the whole-plots) is the following agronomic exper¬ 
iment. We want to compare the yield of a certain crop under different systems of soil 
preparation and different density of seeding. Both operations (tilling and seeding) are 
done mechanically and it is impossible to perform both on small pieces of land. The 
arrangement shown above is then replicated r times, each time using different random¬ 
izations for A and B. 
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13.5.2 Linear Model and ANOVA 

It is clear from our earlier discussion and from the nature of this arrangement, that we 
should have a separate error variance for comparisons among the levels of the .4-factor, 
for comparisons among the levels of the 5-factor, and for interaction comparisons. A 
model reflecting this structure is of the form 

Uijk = M + + a j + e tj + 0k + eE + ( a P)jk + e fjk (13.3 1) 

with i = 1 , 2 ,..., = 1 , 2 ... .. a; fc = 1,2,..., 6 and the eg, e 黑 ， and can be 

considered as i.i.d. random variables with means 0 and variances o\ A . and cr^AB^ 
respectively. The ANOVA for this model is given in Table 13.12. 


13.5.3 Estimating Treatment Contrasts 

The ANOVA table suggests immediately how tests of hypotheses can be performed, us¬ 
ing different error terms for tests about main effects and the interaction. Different error 
terms are also involved in obtaining the variances of estimable functions involving dif¬ 
ferent kinds of treatment effects. We shall outline this briefly for the same comparisons 
discussed in Section 13.2: 


(i) Y^jCjaj is estimated by 

j 3 


rb 


5 : Uijk 


rb/d + b ^2 r i + rbaj 十 b 


• ^(a/3)jfe + e m 


i,k 


H c i Q i + ~ ^2 c j e t> + 7h 12 c ^ e tik 


r tr … rh ,, k 


with 


3 


var I ] = - r Y, c2 j a2 eA^- h 

j 

= ^ X] ^ b<J ^ A + - 

j 

It follows then from Table 13.12 that 

E c A) = AEs 2 ms ㈣ 


var 


i,k 


(13,32) 


(13.33) 
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(ii) ^kdkPk is estimated by T, k d k y..k with 


var V d k 0 h 


dli^eB + g \ab\ 


(13.34) 


following the arguments given for (13.32). It follows, again from Table 13.12, that 


var V ； 4/?/e 


YAms(e b ) 


(13.35) 


(iii) EjCja^ is estimated by 


~ = S Vi ^ k \ when 


r i p t 


j Li k 


d) 

k k=l / 


+ + H + ㈣ ) 开 + Z 

k i k k i k 

Y1 c i a i + \'}L c i e ^ 十 I 0/5)jfc 

j ij J k 

+ ~ Z] H C ^ e tlk 




~Yl c2 j a ^ +~Y1 Cj^AB 

j y j 

+ a eAB\- 


We know from Table 13.12 that 


: MS(E AB ) 


(13.36) 


a 2 eA ^-[MS(E A )~MS(E AB )} 
and hence we find the estimator for (13.36) to be 


E c A?) 卜 ⑽) + b 〒 MS(E AB ) 

< 3 I 0 


(13.37) 
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(iv) is estimated by ^ dk 




with 


var 




(q) 


rq 


^ Vi g \b + ^Iab] 


(13.38) 


and 


var 


㈣ =中 


MS(£ 3 ) + ^-MS(E ab ) 


(v) T jk - Tj'k'{j ^ ^ k f ) is estimated by y. jk - V.yk' with 


曹 (V.jk — y.j 1 ^) = -{^Ia + a eB + g \ab) 


and 


var(y. jfc - y^ k >) 


-MS{E A ) ^ ~MS{E B ) + ( 1 




(13.39) 


(13.40) 


MS(Eab) 
(13.4f) 


There can occur, obviously, also various forms of incomplete split-block designs. 
For example, we may have less than b column whole-plots. For a discussion of the 
analysis of such designs see Hering and Mejza (1997). 


13.6 SPLIT-SPLIT-PLOT DESIGN 

An extension of the simple split-plot design and its variations can be obtained by using 
the split-unit principle a second time, this time for the split-plots to obtain what are 
called split-split-plots\ 


whole-plot 

i a j) 


This allows us to accommodate a third factor C with levels (split-split-plot treatments) 
ci. C 2 ,..., c c . Using three independent randomizations we assign the whole-plot treat¬ 
ments (aj) to one whole-plot in each of r replicates, the split-plot treatments (bk) to 


split-split-plot 

⑹ 



split-plot 

(h) 
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Table 13.13 ANOVA for Split-Split-Plot Design 


Source d.f. MS 


E(MS) 


Replicates 

r — 1 

MS(R) 

A-factor 

a — l 

MS(A) 

Error (A) 

(r - l)(a - 1) 

MS(E A ) 

S-factor 

6-1 

MS(B) 

Ax B 

(a - 1)(6- 1) 

MS(AB) 

Error (B) 

(r — l)a(6 — 1) 

MS(E b ) 

C-factor 

c — 1 

MS(C) 

AxC 

(a - l)(c- 1) 

MS(AC) 

B xC 

(6- l)(c- 1) 

MS(BC) 

Ax B xC 

(a - 1)(6 - l)(c - 1) 

MS(ABC) 

Error (C) 

(r — l)ab(c — 1) 

MS(E C ) 


4c + C(7 eB + 6ccr L + rbc E ^/(a-l) 
4c： + CCr eS+ bccr eA 

^eC + C( 7 Ib + raC S !) 

^eC + C<T eB + rC S 一1)0-1) 

^c + C(J Ib 

a eC + 7?/(c - 1) 

cj\ c + rb E (a7)^/(a - l)(c- 1) 

令 + ra I ： (,/?7)L/(^- l)(c- 1) 

^eC + r E ( a - 1 )(卜 l)(c - 1) 


Total rabc — 1 


one split-plot within each whole-plot, and the split-split-plot treatments (ci) to one 
split-split-plot within each split-plot. The observation for a split-split-plot is then the 
observation for the treatment combination ajbkCi. 

Extending the arguments used in Section 13.2 an appropriate model for observa¬ 
tions from a split-split-plot experiment can be written as 

Vijki = M + ^ + efj + 0 k + (a0) jk + e^ k + 7 / 

+ + {Sl)ki + (oc0y)jki + (13.42) 

The error terms can be considered as i.i.d. random variables with means 0 and variances 
crg C , respectively. An outline of the ANOVA is given in Table 13.13. 

The form of the E(MS) in Table 13.13 indicates how tests of hypotheses should 
be performed using the three different error terms. The number of different types of 
treatment comparisons becomes now quite large. The estimators and variances of such 
comparisons can be worked out easily using methods similar to those given in Sec¬ 
tions 13.2 and 13.5. The estimated variances for some types of simple comparisons 
are given in Table 13.14. Here Tjki denotes the effect of the treatment combination 
a'jbkCi. Comparisons involving only the factors A and B are essentially as given in 
Section 13.2. For variances involving several MS the d.f. for a ^-test have to be esti¬ 
mated using Satterthwaited (1947) procedure (see also Section 13.2). 
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Table 13.14 Estimated Variances of Treatment Comparisons 
in a Split-Split-Plot Design 


Comparison 

var 

亍 .A — 亍 ..V 

2MS{E c )/rab 

亍 j，l 一 亍 j.l ， 

2MS(E c )/rb 

亍 M _ 亍 , kV 

2MS(Ec)/ra 

Tjkl _ 丁 jkl’ 

2MS(E c )/r 

f.ki - f.k'v {k k') 

2[(c - 1)MS(£ C ) + MS(E B )]/rac 

丁 jkl — 丁 jk’l 

2[(c — l)MS(£b) + MS ( 五 s)]/rc 

tj.i -亍 r .i' U ¥= f) 

2[(c- l)MS(£b) + MS(E A ))/rbc 

丁 jkl — Tj'kl 

2[b(c - l)MS(Ec) + {b- I)MS(Eb) + MS(E A )}/rbc 


13.7 EXAMPLES USING SAS® 


Example 13.2: We consider here the simple split-plot design, the SPD(RCBD, RCBD), 
with a — 3 whole-plot treatments, 6 = 2 split-plot treatments and r = 4 replicates. 
The data are given in Table 13.15a. 

To analyze the data we use both PROC GLM and PROC MIXED. The preferred 
procedure is PROC MIXED, but we include PROC GLM only for obtaining the ANOVA 
as given in Table 13.1. The input statements for both analyses are given in Table 13.15a: 

(i) The technical description for Error(A) is given by the (assumed to be negligible) 
interaction, rep* A 

(ii) In the GLM analysis we have to specify the correct test for A by the test state¬ 
ment. 

(iii) In the MIXED analysis the rep*A interaction is declared to be random, thus 
enabling the correct test for A. 

(iv) In PROC MIXED we choose the Satterthwaite procedure to determine the correct 
d.f. for testing various hypotheses. 

The results of both analyses are given in Table 13.15b: 

(v) In the type III ANOVA the P-values for rep, A, and rep*A should be ignored. 
The P-values for B and A* B are correct. The correct P-value for A (.0438) is 
given as a result of specifying the correct test (see (ii) above). 



13.7. EXAMPLES USING SAS® 


563 


(vi) Note that the d.f. for Error, our Error(B), are 9 and the d.f. for rep*v4, our 
Error(A), are 6. 

(vii) In the MIXED analysis the tests for A, B, and ^4 * 5 are performed correctly, 
that is, with the correct error terms and the correct d.f. The results agree with 
those obtained with GLM. 

(viii) The d.f. for the three treatment comparisons specified in the input statement 
are given as 9, 9, 9.24, respectively. This agrees with our discussion in Section 
13.2.5. □ 


Table 13.15 Split-Plot Design 


a) Input statements: 


data spltplot; 
input rep A B y 
datalines; 

1 1 1 56 1 1 2 41 1 2 1 50 1 2 2 36 1 3 1 39 1 3 2 35 

2 1 1 36 2 1 2 25 2 2 1 36 2 2 2 28 2 3 1 33 2 3 2 30 
311 32 312 24 32131322 27 33115332 19 
411 30 412 25 421 35 422 30 431 17 43218 

run; 

proc glmdata=spltplot; 
class rep A B; 

model y = rep A rep*A B A*B; 
test H=A E=rep*A; 
title 1 ’SPD(RCBD ， RCBD )’； 
title2 'BASIC ANOVA ，； 

run; 


proc mixed data=spltploi; 
class rep A B; 

model y = rep A B A*B/ddfm=S atterth; 

random rep^A; 

lsmeans A B A*B; 

contrast ’ (al+a2) vs a3’ A 1 1 -2; 

contrast ’bi-b2’ B 1 -1; 

estimate ’bl_b2’ B 1 -1; 

estimate ， albl-alb2’ B 1-1 A*B 1 -1 0 0 0 0; 

estimate ， albl-a2bl’ A 1 -10A*B 1 0-1 000; 

title2，ANOVA RESULTS AND POST-HOC ANALYSIS'; 

run; 

b) Output: 


SPD(RCBD, RCBD) 
BASIC AMOVA 

The GLM Procedure 

Class Level Information 


Class 


Levels 


Values 
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Table 13.15 (Continued) 

rep 4 1234 

A 3 12 3 

B 2 12 


Number of Observations Read 24 

Number of Observations Used 24 


Dependent Variable : y 





Sum of 





Source 


DF 

Squares 

Mean Square 

F 

Value 

Pr > F 

Model 


14 

2097.083333 

149.791667 


17.23 

<.0001 

Error 


9 

78.250000 

8.694444 




Corrected 

Total 

23 

2175.333333 






R-Square 

Coef f 

Var Root 

MSE y 

Mean 



0.964029 

9.460859 2.948634 31. : 

16667 


Source 


D? 

Type I S3 

Mean Square 

F 

Value 

Pr > F 

rep 


3 

1241.000000 

413.666667 


47.58 

<.0001 

A 


2 

353.083333 

176.541667 


20.31 

0.0005 

rep*A 


6 

192.250000 

32.041667 


3.69 

0.0394 

B 

AxB 


1 

216.C00000 

216.000000 


24.84 

0.0008 


2 

94.750000 

47.375000 


5.45 

0.0281 

Source 


DF 

Type III SS 

Mean Square 

F 

Value 

Pr > F 

rep 


3 

1241.00000C 

413.666667 


47.58 

〈 •C001 

A 


2 

353.083333 

176.541667 


20.31 

0.C005 

rep*A 


6 

192.25000C 

32.041667 


3.69 

0.0394 

B 

A*B 


1 

216.00000C 

216.000000 


24.84 

0.0008 


2 

34.750000 

47.375C00 


5.45 

0.0281 

Tests of 

Hypotheses 

Using the Type III MS 

for rep*A as 

an 

Error 

Tern 

Source 


DF 

Type III SS 

Mean Square 

F 

Value 

Pr > F 

A 


2 

353.0833333 

176.5416667 


5.51 

0.0438 


SPD(RCBD, RC3D) 

ANAOVA RESULTS AND POST-HOC ANALYSIS 
The Mixed Procedure 
Model Information 


Data Set 

Dependent Variable 
Covariance Structure 
Estimation Method 
Residual Variance Method 
Fixed Sffecrs SE Method 
Degress of Freedom Method 


WORK.SPLTPLOT 

Y 

Variance Components 

REML 

Profile 

Model-Based 

Satterthwaite 
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Table 13.15 {Continued) 

SPB(RCBD, RCBD) 

ANAOVA RESULTS AND POST-HOC ANALYSIS 


The Mixed Procedure 

Covariance Parameter 
Estimates 


Cov Parm 

rep* A 
Residual 


Estimate 

11.6736 

8.6944 


Fit Statistics 


-2 Res Log Likelihood 95.1 
AIC (smaller is better) 99.1 
AICC (smaller is better) 100.1 
BIC (smaller is better) 100.1 



Type 3 Tests 

of Fixed 

Effects 



Num 

Den 



Effect 

DF 

DF F 

Value 

Pr > F 

rep 

3 

6 

12.91 

0.0050 

A 

2 

6 

5.51 

0.C438 

B 

1 

9 

24.84 

0.0008 

A*B 

2 

9 

5.45 

0.0281 


Estimates 


Standard 


Label 

Estimate 

Error 

DF 

t Value 

Pr > ；ti 

bl 一 b 2 

6.0000 

1.2038 

9 

4.98 

0.0008 

albl-alb 2 

9.7500 

2.C850 

9 

4.68 

0.0012 

albl-a 2 bl 

0.5000 

3.1912 

9.24 

0.16 

0.8789 


Contrasts 



Num 

Den 


Label 

DF 

DF F Value 

Pr > F 

(al+a2) vs a3 

1 

6 10.99 

0.161 

bl-b 2 

1 

S 24.84 

0.0008 


SPD(RCBD 

, RCBD) 



ANAOVA RESULTS AND POST-HOC ANALYSIS 


The Mixed Procedure 
Least Square Means 
Standard 

Effect A B Estimate Error DF t value Pr > 

A 1 33.6250 2.0013 6 16.80 <• 


丨 2| 
0001 
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Table 13.15 (Continued) 


A 

2 


34.1250 

2.0013 

6 

17 

. 05 

<•0001 

A 

3 


25.7500 

2.0013 

6 

12 

.87 

<.0001 

3 


1 

34.1667 

1.3028 

9.24 

26 

• 23 

<.0001 

B 


2 

28.1667 

1.3028 

9.24 

21 

. 62 

<.0001 

A*B 

1 

1 

38.5000 

2.2565 

9.24 

17 

.06 

< . 0001 

A*B 

1 

2 

28.7500 

2.2565 

9.24 

12 

.74 

<.0001 

A*B 

2 

1 

3S.C000 

2.2565 

9.24 

16 

.84 

<.0001 

A*B 

2 

2 

30.2500 

2.2565 

9.24 

13 

.41 

<.0001 

A*E 

A*B 

3 

i 

26.0000 

2.2565 

9.24 

11 

.52 

<.0001 

3 

2 

25.5000 

2.2565 

9.24 

11 

.30 

<.0001 


Example 13.3: Consider the SPD(CRD, RCBD) or between and within subjects 
design with a = 3, 6 = 2 with unequal numbers of subjects (r\ = 4, r 2 = 3, = 2). 

The data are given in Table 13.16a together with the input statements for the analysis: 

(i) In the model statement we add the option “Satterth” in order to obtain the correct 
d.f. for treatment comparisons. 

(ii) Error(A) is specified as the random effect “subject(^4)”. 

(iii) Error(A) and Error(J5) both have 6 d.f. 

(iv) For the comparison u aib\ — the d.f. are computed according to (13.23) as 

7.99. □ 


Table 13.16 Between and Within Subjects Design 


a) Input statements: 


data spdcrd; 
input A subject B y 
datalines; 

1 1 1 25 1 1 228 1 2 1 27 1 22 31 

1 3 1 28 1 3 2 32 1 4 1 30 1 4 2 32 

2 5 1 30 2 5 2 35 2 6 1 33 2 6 2 39 

2 7 1 33 2 7 2 35 3 8 1 49 3 8 2 55 

3 9 1 49 3 9 2 54 

； run; 


proc mixed data=spdcrd; 
class subject A B; 

model y = A B A*B/ddfm=Satterth; 

random subject(A); 

lsmeans A B A*B; 

estimate ’al-a3’ A 1 0-1; 

estimate ， bl-b2’ B 1 -1; 

estimate , albl-alb2 , B 1 -1 A*B 1-10 0 00; 
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Table 13.16 (Continued) 

estimate ， albl-a3bl，A 10-1 A*B 1000-10; 

title 1 ’BETWEEN AND WITHIN SUBJECTS DESIGN ，； 

title2 'WITH UNEQUAL NUMBERS OF SUBJECTS ’； 

run; 

b) Output: 


BETWEEN AND WITHIN SUBJECTS DESIGN 
WITH UNEQUAL NUMBERS OF SUBJECTS 

The Mixed Procedure 

Model Information 


Data Set 

Dependent Variable 
Covariance Structure 
Estimation Method 
Residual Variance Method 
Fixed Effects SE Method 
Degrees of Freedom Method 


WORK.SPDCRD 

y 

Variance Components 

REML 

Profile 

Model-Based 

Satterthwaite 


Class Level Information 


Class 


Levels Values 


subject 

A 

B 


9 123456789 

3 12 3 

2 12 


Number of Observations 


Number of Observations Read 18 

Number of Observations Used 18 

Number of Observations Not Used 0 

Covariance Parameter 
Estimates 


Cov Parm Estimate 

subject(A) 2.4167 

Residual 0.9931 


Type 3 Tests of Fixed Effects 
Num Den 

Effect DF DF F Value Pr > F 


A 

2 

6 

119.29 

<.0001 

B 

1 

6 

79.56 

0.00C1 

A*3 

2 

6 

1.76 

0.2511 




Estimates 






Standard 




Label 

Estimate 

Error 

DF 

t Value 

Pr > |ti 

al-a3 

-22.6250 

1.4781 

6 

-15.31 

<.0001 

bl-b2 

-4.3611 

0.4889 

6 

-8.92 

0.0001 

albl-alb2 

-3.2500 

0.7046 

6 

-4.61 

0.0036 

albl-a3bl 

-21.5000 

1.5992 ' 

7.99 

-13.44 

<.00C1 
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Table 13.16 (Continued) 


Least Squares Means 


Effect 

A 

B 

Estimate 

Standard 

Error 

DF 

t Value 

Pr > |t1 

A 

1 


29.1250 

0.8534 

6 

34.13 

<•0001 

A 

2 


34.1667 

0.9854 

6 

34.67 

<.0001 

A 

3 


51.75C0 

1.2069 

6 

42.88 

<.0001 

B 


1 

36.1667 

0.6406 

7.99 

56.45 

<.0001 

B 


2 

40.5278 

0.6406 

7.99 

63.26 

<.0001 

A*3 

1 

1 

27.5000 

0.9233 

7.S9 

29.79 

<.0001 

A*3 

1 

2 

30.7500 

0.9233 

7.99 

33.31 

<.0C0I 

A*B 

2 

1 

32.0COO 

1.066 ： 

7.99 

30.02 

<.0001 

A*3 

2 

2 

36.3333 

1.0661 

7.99 

34.C8 

<.0001 

A*B 

3 

1 

49.0000 

1.3057 

7.99 

37.53 

<.0001 

A*B 

3 

2 

54.5000 

1.3057 

7.99 

41.74 

<.000 ： 


For the other split-plot designs mentioned in Section 13.4 we give below the input 
statements for PROC MIXED. 


SPD(CRD ， LSD) [Model (13.28)]: 


CLASS 

Subject Order A B\ 



MODEL 

Y ~ A Order B A * 

B 

A* Order/ddfm=satterth; 

RANDOM 

Subject (A); 



SPD(LSD, RCBD) [Model (13.29)]: 



CLASS 

Row Column A 

B., 


MODEL 

Y = Row Column A 

B 

A * _B/ddfm=satterth; 

RANDOM 

Row*Column*A; 



SPD(CRD, IBD): [Model (13.27)]: 




CLASS Subject A B\ 

MODEL Y = A B A * B/ddfm=satterth; 

RANDOM Subject(A); 
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SPD(GRBD ， RCBD) [Model (13.30)]: 


CLASS Block A Rep B; 

MODEL Y = Block A Block * A B A^B 

Block * B Block * ^4 * B/ddfm=satterth; 
RANDOM Rep(A * Block); 

where Rep refers to the replication of a treatment within a block. 
SPD(RCBD, GRBD) 


CLASS Rep A B\ 

MODEL Y = Rep A B A^B Rep * 5/ddfm=satterth; 
RANDOM Rep*A; 


SPLIT-BLOCK DESIGN [Model (13.31)]: 


CLASS Rep A B; 

MODEL Y = Rep A B B /ddfm=satterth; 

RANDOM Rep*A Rep*E; 

SPLIT-SPLIT-PLOT DESIGN [Model (13.42)]: 


CLASS Rep A B\ 

MODEL y = Rep A B A 木 B C A^C B^C 
B * C/ddfm=satterth; 

RANDOM Rep*A Rep*A * B\ 


13.8 EXERCISES 

13.1 Consider the SPD(RCBD, RCBD) with a levels for the whole-plot factor, b levels 
for the split-plot factor, r replications, and subsampling, that is, n observations 
for each split-plot. 
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(i) Write out an appropriate model for the observations. 

(ii) Write out the corresponding ANOVA table. 

(iii) Indicate how you would test hypotheses about the A-factor, the factor 
and the A x 5 interaction. 

(iv) Give the SAS commands for performing the analysis. 

(v) Using the notation of Section 13.2.4, obtain expressions for vhr(aj - A〆 ） 
and var(/3fc - /3fe0- 

13.2 Consider the SPD (CRD, RCBD) and suppose that the A-factor itself has a fac¬ 
torial structure, that is, the a levels of A are the a\ - combinations of the a\ 
levels of factor A\ and the a 2 levels of factor A 2 . Similarly, the b levels of the 
S-factor are the 61 • 62 combinations of the 61 levels of factor B\ and the 62 
levels of factor B 2 . 

(i) Write out a model for observations from this experiment. 

(ii) Write out the corresponding ANOVA table. 

(iii) Explain how you would test hypotheses about all main effects and interac¬ 
tions. 

(iv) Give the SAS commands for performing the analysis. 

13.3 Suppose that in Exercise 13.2 the A- and 召 -factors are 2 2 factorials with factors 

Ai,A 2 and respectively. 

(i) Give expressions for the estimates of the main effects Ai and A 2 , and for 
the interaction AiA 2 . 

(ii) Give expressions for var(Ai), var(^ 2 ), var(^i A 2 ) and for the estimators 
of these variances. 

(iii) Do the same for B 1 .B 2 , B 1 B 2 . 

(iv) Give an expression for the estimator for the interaction A\Bi, its variance, 
and the estimator for this variance. 

13.4 Consider an experiment where the amount of dry matter is measured on wheat 
plants grown in different levels of moisture and with different fertilizers (Mil- 
liken and Johnson, 1984). The experimental material consists of 60 peat pots 
and 15 plastic trays; four (4) peat pots can be put in one tray. The moisture treat¬ 
ment consists of adding 10, 20, 30,40, or 50 ml of water to the tray, where it will 
be absorbed by the pots. The experiment is being conducted at 3 different green¬ 
houses such that 5 trays are used in each greenhouse and in each greenhouse 
each moisture level is assigned randomly to one tray. The fertilizer treatments 
are represented by a 2 2 factorial of low and high levels of nitrogen and phos¬ 
phate. Each fertilizer combination is applied (at random) to individual pots in a 
tray such that each combination occurs once in each tray. In each pot 5 plants 
are grown, and observations are made on the individual plants. 
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(i) Give a schematic (that is, graphical) representation of the layout of the 
experiment. 

(ii) Give the name of the error-control design for this experiment. 

(iii) Give an appropriate linear model for the design described in (ii), which 
reflects the structures of the error-control design, the treatment design, and 
the sampling design. 

(iv) Outline the ANOVA table based on the model given in (iii), giving sources 
of variation, d.f.，and E(MS). 

(v) Explain how you would test whether there exists interaction between nitro¬ 
gen and the moisture treatment. 

(vi) The researcher is interested in finding out whether there exists a linear trend 
for the effect of moisture on dry matter. Give an expression for the estimate 
of the linear trend and give its standard error. 

13.5 Consider an experiment where the amount of dry matter is measured on wheat 
plants grown in different levels of moisture and with different fertilizers using a 
split-plot-type design (Milliken and Johnson, 1984).There are 48 different peat 
pots and 12 plastic trays; four (4) pots can be put in each tray. The moisture 
treatment consists of adding 40, 80 ， 120, or 160 ml of water to the tray, where 
it will be absorbed by the pots. The levels of moisture are assigned randomly to 
the trays such that each moisture level occurs 3 times. The fertilizer treatments 
are represented by all possible combinations of 0 and 1 unit of nitrogen, and 0 
and 1 unit of phosphate. The fertilizer is applied individually to each pot in a 
tray such that each combination occurs once in each tray. 

(i) What are 

(a) the whole-plots, 

(b) the split-plots, 

(c) the whole-plot treatment, 

(d) the split-plot treatment? 

(ii) What kind of split-plot-type design is this? Write out an appropriate linear 
model. 

(iii) Outline the ANOVA table in as much detail as possible based on the de¬ 
scription of the experiment (give sources of variation and d.f.). 

(iv) Explain how you would test whether nitrogen has an effect on dry matter. 

(v) The researcher is interested in finding out whether there exists a linear trend 
for the effect of moisture on dry matter. Give an expression for the estimate 

of the linear trend and its standard error (= square root of the estimated variance). 

13.6 Discuss the layout and analysis of a SPD (BIBD, RCBD) and describe a possible 
application for this design. 

13.7 Suppose a researcher comes to you to get some help on the analysis of the fol¬ 
lowing data set: 
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Factor B 


Factor A 

bi 


b3 

h 


X 

X 

X 

X 


X 

X 

X 

X 


X 

X 

X 

X 

a2 

X 

X 

X 

X 


X 

X 

X 

X 

^3 

X 

X 

X 

X 


where each x represents an observation. 

(i) What questions would you ask the investigator before you can analyze the 
data? 

(ii) Describe three scenarios (analogous to Study 2 in Section 2.6.2) which 
could have given rise to this data set. 

(iii) For each scenario write out an appropriate linear model and the correspond¬ 
ing ANOVA table. 

(iv) For each case explain how you would make statistical inferences about the 
main effects A and B and the interaction Ax B. 



CHAPTER 14 


Designs with Repeated 
Measures 


14.1 INTRODUCTION 


For the title of this chapter we have, quite deliberately, not chosen the phrases repeated 
measures designs or repeated measurement designs ， which, unfortunately, mean differ¬ 
ent things to different people. For Hedayat and Afsarinejad (1975), for example, they 
refer mainly to cross-over designs (see Section 10.7 and Chapter 11.19), whereas for 
Hand and Crowder (1996), for example, they refer to designs with longitudinal data; 
that is, measurements repeated over time. This is the point of view we take here, too. 
In that sense then this aspect of experimental design is not so much an aspect of error- 
control or treatment design even though they play a role, as we shall see, but mainly 
an aspect of the observation design. As such repeated measures can be associated with 
any of the error-control designs we have discussed in previous chapters, for example a 
CRD with repeated measures. 

We encounter repeated measures most often in medical, parmaceutical, agricultural 
or psychological applications, where it is intended to study the efficacy of treatments 
over a certain time period. 

Example 14.1: (Frison and Pocock, 1992): A randomized trial of 152 patients with 
coronary heart disease compared an active drug with a placebo with respect to a pos¬ 
sible adverse drug effect on the liver. The liver enzyme CPK was measured in each 
patient before treatment, at the time of randomization and every 1.5 months after treat¬ 
ment. □ 
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14.2 METHODS FOR ANALYZING 
REPEATED MEASURES DATA 

There exist several methods of analyzing such data. For an overview see Everitt (1995 )， 
and Keselman, Algina and Kowalchuk (2001). We shall mention here some methods 
and provide some more details in the following sections. 

In order to keep the discussion simple let us consider the situation where t treat¬ 
ments are applied randomly to r experimental units (for instance, patients, animals, 
pieces of land), and measurements are being taken on each EU at p times after ad¬ 
ministration of the treatment, say ti, t p . In some situations a measurement at 

or immediately preceding the time of administration, say to, may be taken. We, thus, 
have a CRD with p or p + 1 repeated measures. The time points may or may not be 
equidistant. The reader should have no difficulty extending the following discussion 
to other error-control designs. Also, Finney (1990) points out that repeated measures 
may not be confined to a temporal situation, but may involve also spatial situations 
as measurements are taken at different distances from the point of application of the 
treatment, for example, different depths of soil in a compaction study. 

14.2.1 Comparisons at Separate Time Points 

A commonly used approach is to consider the data at each time point arising from a 
separate “experiment”. Let us write a model for the observations as 

l/ijk = AHj’/c 十 ^ijk (14.1) 

with z = 1, 2 ,..j = 1 , 2,.. r; fc = 1, 2 , ..p and 

fHjk = " + D + Sij-^rTk + (jT)e ik (14.2) 

with T{ = 2 -th treatment effect 

Sij = effect of the j-th subject (EU) for i-th treatment 

Tk = fc-th time effect 

(rT)ik = treatment-time interaction effect. 

We should point out that model (14.2) is essentially equivalent to model (13.27) 
under the following correspondence: 



— 丁 i 

e A . 

u 

— S ij 

0k 

一 T k 


(TT^ik. 

^ijk 

— ^ijk 
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and the e 勿 have the following covariance structure. If we write 

Cij = Cij2, . . . ， ^ijp) (14.3) 

then 

and 

= 〉: = (14.4) 

for all z, j, and k, k f = 1,2, .. p. 

For each time k we then consider contrasts of the form c i Pd.k with ^2 Ci = 0, 

i 、 

by looking at the observations at each time point as arising from a CRD. We see from 
(14.2) that 

> : Q f^i'k = 〉: Q(Ti + + ( 丁 T'jik )， (14.5) 

i i 

that is, the contrasts at different time points are possibly different because of treatment- 
time interactions. This is, of course, the main reason for looking at different time points 
as we want to find out whether the treatments have different effects over time, and if 
so, when the differences appear first. 

A word of caution is in order here because the tests performed at each time point 
are not independent since the errors in (14.1) are now correlated. These correlations 
may become smaller as the time points are further apart. We may therefore choose time 
points which are not too “close” together, depending, of course, on the subject matter 
context. 

14.2.2 Use of Summary Measures 

Rather than performing several tests as described above, another approach may be to 
perform just one analysis based on a summary measure or performance feature for 
each subject over the entire set of time points. Such summary measures will have to 
be determined by the type of question we are investigating. For example, if we are in¬ 
terested in comparing the growth curve due to different treatments, then the area under 
the growth curve may be an appropriate summary measure. On other occasions the 
average response to treatment may be the most relevant summary measure (Matthews 
et al” 1990; Frison and Pocock, 1992). 

14.2.3 Trend Analysis 

In many situations it is important to detect trends over time or profiles or, perhaps even 
more importantly, to see whether the trends are the same for the different treatments. 
One way to approach these questions is as follows (see Rowell and Walters, 1976). 
Suppose we characterize the trends by a set of contrasts among the in model (14.1), 
denoted by q\T with 

ci = (cu,c 2 h - - - ,c p iY ( 14 . 6 ) 
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Table 14.1 ANOVA for Model (14.7) 


Source 

d.f. 

SS 

五 (MS) 

"r 

1 

trzf_. 

af + trfii 2 

7u 

t-1 

r J2( 5 li .- 2；..) 2 

i=l 

E7fi 2 

^ +r t-l 

Error/ 

t(r - 1) 

( z lij — 习 i.) 2 

i=l j=l 


Total/ 

tr 

E E 4 

i=\ j=l 



and 

= 0( / = L2,... ,q), 

and 

t 二 m …, T p y 

In many cases the time points will be equally spaced, for instance, in intervals of 15 
minutes, in which case it is useful and convenient to characterize the trend as a polyno¬ 
mial over time and take the c/ of (14.6) as the orthogonal polynomials of order p and 
degree l = (see Chapter 7). From (14.1) we then derive sequentially the 

models (that is, for l = 1,2...., q) 

z Uj = = ° klTk + C kl{^T) ik + ^2 Ckl^ijk 

k k k k 

=+ (14.7) 

with 

k 

lit = yZ C kl( rT )ik 

k 

e !ij = y^Ckie ijk 
k 


and Eie^) — 0, var(e z *^) = = erf say. Model (14.7 )leads to the ANOVA 

given in Table 14.1, 
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Hence, to test for an overall trend defined by (14.4), that is, Hq ： ^2 k CkiTk = fi* = 
0, we use the test 

r — ss("r) 

MS (Error/) 

with 1 and a(r — 1) d.f., and to test whether this trend is the same for all the treatments, 
that is, Ho : = 7/2 ~ ~ 7/V we use the test statistic 

r — MSh,*) 

MS (Error/) 

with a - 1 and a(r - 1) d.f. We perform this analysis for every / = 1,2,, q. These 
tests generally provide an informative picture about the behavior of the treatments over 
time. 


14.2.4 The ANOVA Method 

In Section 13.4.2 we have referred to the design with repeated measures as a split- 
plot design in time. We have pointed out, however, that there is no randomization for 
the 5-factor, that is, for time, and that the ef^ k = in model (13.27) and (14.1), 
respectively, have a correlation structure, which we now have acknowledged explicitly 
in (14.4). For these reasons the testing procedures derived from the ANOVA in Table 
13.3 may be invalid. 

However, if ^ in (14.4) satisfies the so-called Huynh-Feldt condition (Huynh and 
Feldt, 1970) given as 

= Ai p + 7 a ； + 3 p y ! (14.8) 

where 入 is a constant and 7 = (7i ， 72 ， • •. ， 7 P ) / is a vector of constants, then the 
usual F-test for testing Ho: J\ = 了 2 = :… =T p [model (13.27) and 14.1] is valid. 
The condition (14.8) which can be written alternatively as 

^kk' = AJ/c/c’ + ， 


where 5kk' = 1 if fc = fe’ and = 0 otherwise, contains as a special case a structure 
referred to as compound symmetry, characterized by 


f a 2 for k = k ( 
\ pa 2 for k k r 


(14.9) 


that is, equal variances and covariances for the 4). For the case of compound sym¬ 
metry (CS) for the in (14.3) Geisser and Greenhouse (1958) already proved that 
the usual analysis for the split-plot design is valid. The case we need to consider here 
then in connection with repeated measures designs is when neither (14.9) nor (14.8) 
are satisfied. 
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14.2.5 Mixed Model Analysis 

Recall that the data from a CRD with repeated measures (often also referred to as a 
between- and within-subjects design) are described by the model (see (14.1) ， (14.2 )， 
(14.4)) t ‘ ’ 

Vijk = M + Tfc + + (tT*)^ -f- (14.10) 

where 〆 + + (rT)^ is the fixed part and Sij is the random part of a mixed 

model. More specifically, concerning the random part, the sy are i.i.d. (0, random 
variables, and the have a covariance structure given by (14.4). As a consequence 
the variance-covariance matrix for the vector of n = trp observations, y, is given by 

var(y) = \ n a 2 s + I ir x ^ = V, (14.11) 

where “x” indicates a Kronecker product. 

It is V of (14.11) that we would need to use to estimate and make inference about 
the fixed effects in model (14.10) (see Sections 4.6.2 and 4.18). Unfortunately, we 
do not know the variance and covariance components in (14.11) and, indeed, we do 
not even know the covariance structure represented by ^2, Hence, in order to analyze 
repeated measures data we need to make an assumption about the structure ^2 an d then 
use a suitable estimation procedure to estimate the variance and covariance component 
to obtain say, and then solve the Aitken-like equations (4.80) using This is, 
generally speaking, not an easy task and for the average user possible only with the 
availability of suitable software, such as SAS PROC MIXED (SAS Institute, 2002- 
2003). 

We shall not go into the details of SAS PROC MIXED but mention the form of 
some of the possible covariance structures (using p = 4) that this program can use and 
that we consider to be relevant for this situation: 

Compound Symmetry (CS): 


a 2 

per 2 



9^ 

a 2 



〆 


a 2 




/9CT 2 

a 2 


that is, all the variances (diagonal) are the same, and all covariances (off-diagonal) are 
the same, regardless of the distance between time points (see (14.9)). 


First-order Autoregressive (AR(1)): 


P 

P 


P P 


P P P 


P 

P 
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that is, the variances are the same, and, since | p |< 1, the covariance diminishes as the 
time points become further apart. 

Unstructured (UN): 



^21 

^31 

CJ41 

<721 

^2 

732 

742 

^31 

(^32 

^3 

J43 

<J 4 i 

<742 

J43 

^4 


that is, all variances and all covariances are possibly different. 
Spatial Power (SP(POW)(C)): 


1 

p dl2 

p dl3 

p di4 

p d 21 

1 

/ 23 

p d2i 

p d31 

p d ^ 2 

1 

p ds4 

p d41 

p d42 

p d ^ 

1 


that is, just like AR(1) the correlations depend on the distance between time or spatial 
points, except that the power of p is now determined by a measure, dij, of the actual 
distance between points i and j.. 

As we have mentioned before, it is unlikely that the CS is appropriate for repeated 
measures data, but if this structure holds then the analysis is equivalent to the ANOVA 
given in Table 13.3. This is the reason why the CS structure is appealing and frequently 
used. Even though we do not go as far as Finney (1990) who says that it should never 
be used (unless p = 2), we caution the user to be very careful with its use. 

We prefer, in general, the AR(1) structure because it seems to reflect an intuitive 
amount of correlation between observations at different time points and to allow for the 
correlation to become smaller as the times of observation are farther apart. The same 
comments apply to SP(POW)(C), in particular if the distances between points are not 
the same. 

The safest assumption about is certainly UN. But the drawback is that it re¬ 
quires the estimation of many parameters in V which will make it not a very powerful 
procedure. 

SAS PROC MIXED allows for other covariance structures, but the ones mentioned 
above will generally suffice from a practical point of view, and we shall illustrate their 
use in Section 14.3. 

To conform somewhat to the SAS PROC MIXED notation we shall rewrite model 
(14.1) for the general situation (in matrix notation) as 


y = Xa + U/3 + e 


(14.12) 
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where Xa refers to the fixed part, and U/3 refers to the random part, with X and U 
known matrices and 


E{(3) = 0, var(/3) = G 

E(e) == 0, var(e) = R 

(14.13) 

so that (14.11) becomes 

var(y) = UGU' + R = V. 

(14.14) 


In our situation the fixed part of (14.12) represents parameters associated with the treat¬ 
ment, time, and treatment x time interaction effects as well as possibly some blocking 
factor effects for error-control designs other than the CRD. In addition, we may also 
have other treatment effects in connection with a real split-plot structure, which would 
then contribute further error terms to U/3. Thus model (14.12) represents the most 
general case. 

The fitting of model (14.12) in SAS PROC MIXED can be done by specifying one 
of several procedures. The default option is the residual maximum likelihood procedure 
(REML). For a description see Section II. 1.11.2. 

14.3 EXAMPLES USING SAS® 


Example 14.2: Consider an experiment comparing different drugs with respect to 
their efficacy to control the heart rate of certain patients. We have 亡 = 3 drugs, each 
drug being given to r = 5 patients, and the heart rate is measured at p = 4 different 
(equispaced) times. The data are given in Table 14.2a. 

We perform several analysis mainly to illustrate different procedures as described 
in Section 14.2.4 and 14.2.5 and show their similarities and differences. 

The SAS PROC GLM and MIXED for the ANOVA method and the mixed model 
analysis using the covariance structures CS, AR(1), and UN are given in Table 14.2a. 
In each case we perform the basic analysis. Only for the AR(1) method (which is our 
preferred method) do we follow up with a post-hoc analysis. The results for all analysis 
are given in Table 14.2b. 

We comment as follows: 

(i) For the ANOVA method we need to specify the correct error term for testing 
hypothes about drugs. This error term corresponds to error(A) in the SPD(CRD, 
RCBD) (see Section 13.4.1) and is given in technical terms by “person(drug)”. 

(ii) The results show that there are no significant differences among drugs (P = .29), 
but we shall return to this point later in light of the fact that there are significant 
changes over time (P < .0001) and, most importantly, significant interaction 
between drugs and time (P < .0001). 


(iii) For the CS method we obtain the same test results as mentioned in (ii) above. 
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Table 14.2 CRD with Repeated Measures 


a) Input statements: 
data heart; 

input drug person time rate 
datalines ； 

1 1 1 72 I 1 286 1 1 3 81 1 1 477 
1 2 J 78 f 2283 1 23 88 1 2 481 
1 3 1 71 1 3 282 1 3 3 81 1 34 75 

1 4 1 72 1 4 2 83 1 4 3 83 1 4 4 69 

1 5 1 66 1 5 2 79 1 5 3 77 1 5 4 66 

2 1 1 85 2 1 2 86 2 1 3 83 2 1 4 80 

221 82 222 86 223 80 224 84 

2 3 1 71 2 3 278 2 3 3 702 3 475 

24 1 83 242 88 243 79 244 81 

25 J 8625285253 7625476 

3 1 1 69 3 1 2 73 3 1 3 72 3 1 4 74 
321 66 322 62 323 67 324 73 
3 3 1 84 3 3 2 90 3 3 3 88 3 3 4 87 
34 1 80 34281 34 3 77 34 4 72 
3 5 1 72 3 5 2 72 3 5 3 69 3 5 4 70 

run; 

proc glm data=heart; 
class drug person time; 

model rate=drug person(drug) time drug*time/SS3; 
test h=drug e=person(drug); 
title 1 ’CRD WITH REPEATED MEASURES ’； 
title2 ’ANALYZED AS SPD(CRD ， RCBD )’ ； 

run; 

proc mixed data=heart; 

class drug person time; 

model rate=drug time drug*time; 

repeated/type=cs subject=person(drug) rcorr; 

title2 ’WITH COMPOUND SYMMETRY ’； 

run; 

proc mixed data=heart; 

class drug person time; 

model rate=drug time drug*time; 

repeated/type=ar(!) subject=person(drug) rcorr; 

estimate ， drugllin’ time -3-113 drug^time -3 -1 1 30000000 0; 

estimate ’drug21in’ time -3-113 drug*time 0 0 0 0-3-1 1 30000; 

estimate ’drug31in’ time -3-1 13 drug^time 0 0 0 0 0 0 0 0-3 -1 1 3; 

estimate 'drug 1 qua 5 time 1-1-11 drug*time 1 -1 -1 1 0 0 0 0 0 0 0 0; 

estimate ， drug2qua’ time 1-1-11 drug*time 0 0 0 0 1 -1-1 1 0 0 0 0; 

estimate ’drug3qua’ time 1-1-11 drug*time000000001 -1 -1 1; 

estimate 'drug 1 cub' time -13-3 1 drug*time -1 3-3 1 00000000; 

estimate ’drug2cub’ time -13-3 1 drug*time 0 0 0 0-1 3 -3 1 0000; 

estimate ’drug3cub’ time -13-3 1 drug*time 0 0 0 0 0 0 0 0-1 3-3 1; 

title2 9 WITH AUTOREGRESSIVE ERRORS ，； 

run; 
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Table 14.2 (Continued) 

proc mixed data=heart; 

class drug person time; 

model rate=drug time drug*time; 

repeated/type=un subject=person(drug) rcorr; 

title2 'WITH UNSPECIFIED CORRELATION STRUCTURE ’； 

run; 

proc sort data=heart; 
by drug time; 

proc means mean noprint; 
by drug time; 
var rate; 

output out=meandata mean=m_rate; 

title2 ’PLOT OF MEAN HEART RATE OVER TIME’ ； 

run; 

proc plot data=meandata; 
plot mjrate*time=drug; 
label m_rate=’Mean Heart Rate ’； 

run; 

quit; 

b.) Output: 


CRD WITH REPEATED MEASURES 
ANALYZED AS SPD(CRD^RCBD) 

The GLM Procedure 

Class Level Information 


Class 

Levels 

Values 

drug 

3 

1 

2 

3 

person 

5 

1 

2 

3 4 5 

time 

4 

1 

2 

3 4 


Number of Observations Read 60 

Number of Observations Used 60 


Dependent Variable : rate 


Source 

DF 

Sum of 
Squares 

Mean Square 

F Value 

Pr > F 

Model 

23 

2449.500000 

106.500000 

12.73 

<•0001 

Error 

36 

301.100C00 

8.363889 



Corrected Total 

59 

2750.600000 
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Table 14.2 (Continued) 

R-Square Coeff Var Root MSE rate Mean 

0.890533 3.722058 2.892039 77.70000 


Source 


DF Type III SS Mean Square F Value Pr > F 


drug 

person(drug) 
time 

drug*time 


2 337.600000 168.800000 20.18 <.0001 

12 1498.500000 124.875000 14.93 <.0001 

3 256.333333 85.444444 10.22 <.0001 

6 357.066667 59.511111 7.12 <.0001 


Tests of Hypotheses Using the Type III 
MS for person(drug) as an Error Term 

Source DF Type III SS Mean Square F Value Pr > F 

drug 2 337.6000000 168.8000000 . 1.35 0.2955 


The Mixed Procedure 
Model Information 

Data Set WORK.HEART 

Dependent Variable rate 

Covariance Structure Compound Symmetry 

Subject Effect person(drug) 

Estimation Method REML 

Residual Variance Method Profile 

Fixed Effects SE Method Model-Based 

Degrees of Freedom Method Between-Within 


Iteration History 

Iteration Evaluations -2 Res Log Like Criterion 

0 1 329.48905107 

1 1 289.92035887 0.00000000 


Convergence criteria met. 


CRD WITH REPEATED MEASURES 
WITH COMPOUND SYMMETRY 

The Mixed Procedure 

Estimated R Correlation Matrix for person(drug) 1 1 
Row Coll Col2 Col3 Col4 

1 1.0000 0.7769 0.7769 0,7769 

2 0.7769 1.0000 0.7769 0.7769 

3 0.7769 0.7769 1.0000 0.7769 

4 0.7769 0.7769 0.7769 1.0000 
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Table 14.2 (Continued) 


Covariance Parameter Estimates 


Cov Parm Subject 


Estimate 


CS person(drug) 

Residual 


29.1278 

8.3639 


Fit Statistics 


-2 Res Log Likelihood 289.9 


AIC 

(smaller 

is 

better) 


293.9 


AICC 

(smaller 

is better) 


294.2 


BIC 

(smaller 

is 

better) 


295.3 


Null Model 

Likelihood 

Ratio Test 



DF 

Chi-Square 

Pr > C 

: hiSq 



1 



39.57 

<. 

0001 


Type 

3 Tests 

of Fixed 

[Effects 




Num 


Den 




Effect 


DF 


DF 

F Value 

Pr 

> F 

drug 


2 


12 

1.35 

0 . 

2955 

time 


3 


36 

10.22 

<. 

0C01 

drug*time 


6 


36 

7.12 

<. 

OCOl 


CRD 

WITH 

REPEATED MEASURES 




WITH AUTOREGRESSIVE ERRORS 


The Mixed Procedure 
Model Information 


Data Set 

Dependent Variable 
Covariance Structure 
Subject Effect 
Estinazion Method 
Residual Variance Kerhod 
Fixed Effects SE Method 
Degrees of Freedom Method 


WORK.HEART 
rate 

Autoregressive 
person(drug) 
REM1 
Profile 
Model-Based 
Between-Within 


Iteration History 


Iteration 


Evaluations 


-2 Res Log Like 


Criterion 


329.48905107 

285.94895046 

285.94254325 

285.94253892 


0.00006372 
0.00000004 
0.00000000 


Convergence criteria met. 
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Estimates 

Standard 


Label 

Estimate 

Error 

DF 

drugllin 

4.S00C 

8.4206 

36 

drug21in 

-13.60CO 

8.4206 

36 

drug31ir. 

2.0000 

8.4206 

36 

druglqua 

-19.2000 

2.3C54 

36 

drug2qua 

-1. 6000 

2.3054 

36 

drug3qua 

-0.8000 

2.3054 

36 

drug1cub 

3.60CO 

4.0298 

36 

drug2cub 

18.8000 

4.0298 

36 

drug3cub 

4.0000 

4.0298 

36 


CRD WITH REPEATED MEASURES 
WITH UNSPECIFIED CORRELATION STRUCTURE 

The Mixed Procedure 


0.2707 
<.C001 
<•0001 


drug 

nine 

drug*time 


Table 14.2 (Continued) 


Estimated R Correlation Matrix for person(drug) 1 1 
Row Coll Coi2 Col3 Col4 

1 1.0000 C.8278 C.6852 0.5672 

2 0.8278 1.0000 0.8278 0.6852 

3 0.6852 0.8278 1.0000 0.8278 

4 0.5672 C.6852 0,8278 1.0000 


Covariance Parameter Estimates 

Cov Parm Subject Estimate 

AR(1) person(drug) 0.8278 

Residual 36.0107 


Fit Statistics 


-2 Res Log Likelihood 285.9 
AIC (smaller is better) 289.9 
AICC (smaller is better) 290.2 
BIC (smaller is better) 291.4 


Null Model Likelihood Ratio Test 
DF Chi-Square Fr > ChiSq 

1 43.55 <.0001 

Type 3 Tests of Fixed Effects 
Nurr. Den 

Effect DF DF F Value Pr > F 


Pr > ItI 

0.5722 
0 ■ 1150 
0.8136 
<.0001 
0.4921 
0.73G6 
0.3776 
<.0001 
0.3275 


7 2 4 3 9 5 9 7 9 
5 6 2 3 6 3 8 6 9 

0108000 4C 

- III 
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Table 14.2 {Continued) 


Model Information 


Data Set 

Dependent Variable 
Covariance Structure 
Subject Effect 
Estimation Method 
Residual Variance Method 
Fixed Effects SE Method 
Degrees of Freedom Method. 


WORK.HEART 
rate 

Unstructured 
person(drug) 
REML 
None 

Model-Based 

Between-Within 


Iteration History 


Iteration 


Evaluations -2 Res Log Like 


Criterion 


0 1 329.48905107 

1 1 278.84809316 0.00000000 


Estimated R Correlation Matrix for person(drug) 1 1 


Row Coll Col2 


Ccl3 


： o!4 


1.0000 

0.8498 

0.8889 

0.6246 


0.8498 

1.0000 

0.8697 

0.6315 


0.8889 

0.8697 

1.0000 

0.7945 


0.6246 

0.6315 

0.7945 

1.0000 


Covariance Parameter Estimates 


Cov Parm 

Subject 

Estimate 

UN(1,1) 

person(drug) 

37.2333 

UN(2,1) 

person(drug) 

34.3167 

UN (2,2) 

person(drug) 

43.8000 

UN(3, 1) 

person(drug) 

32.9333 

UN(3,2) 

person(drug) 

34.9500 

UN (3, 3) 

person(drug) 

36.8667 

UN(4,1) 

person(drug) 

21.5833 

UN(4,2) 

person(drug) 

23.6667 

UNM, 3) 

person(drug) 

27.3167 

UN(4,4) 

person(drug) 

32.0667 



Fit Statistics 


-2 Res Log Likelihood 

278.8 

AIC 

(smaller is better) 

298.8 

AICC 

(smaller is better) 

304.8 

BIC 

(smaller is better) 

305.9 


Null Model Likelihood Ratio Test 
DF Chi-Square Pr > ChiSq 


9 


50.64 


<.0001 
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Table 14.2 (Continued) 


Type 3 Tests of Fixed Effects 



Num 

Den 


Effect 

DF 

DF 

F Value 

drug 

2 

12 

1.35 

time 

3 

12 

12.35 

drug*time 

6 

12 

17.31 


CRD WITH 

REPEATED 

MEASURES 


PLOT OF MEAN HEART RATE OVER Til 



588 


CHAPTER 14. DESIGNS WITH REPEATED MEASURES 


(iv) The option “rcorr” in the input statements for all mixed model procedures results 
in printing out the correlation matrix for one person (subject). 

(v) For the CS method the correlation matrix is given by .22313 + ,776933’， that 
is, the correlation between two observations for the same person is estimated as 
r = .7769. This value is obtained as the intra-class correlation 



r = 

B e(A) 


where 

二 2 

a e(B) - 

=(MS[Person(drug)] — 

MS ⑽ /4 



二 (124.875 - 8.364)/4 

= 29.1278. 


(vii) The correlation between the observations at two adjacent time points is estimated 
as r = .8278. 

(viii) For the UN method the test results, again, are essentially the same as for the 
other analyses. 

(ix) The estimated correlation structure, in fact, shows that at least for the first three 
time points the correlations are almost equal, that is, exhibiting a CS structure. 

(x) Looking at the drug-time interaction plot it seems worthwhile to look at the in¬ 

dividual trends. To do that we fit linear, quadratic and cubic polynomials. The 
results show that there is a significant quadratic trend for drug 1 (P < .0001) 
and a significant cubic trend for drug 2 (P < .0001). The interaction is clearly 
not codirectional and hence it may not be appropriate to consider a test for the 
overall drug effects (see (ii) above). □ 

Example 14.3: The basic design for this pollution study is a SPD(CRD, RCBD) 
(see Section 13.4.1). We have a = 2 pollutants (P) as whole-plot treatments, r = 2 
replications, 6 = 4 split-plot treatments with a 2 2 factorial structure (two varieties, Vi, 
V 2 , and two growth enhancing treatments, Ai, A 2 ,). The pollutants are applied to four 
pots in a growth chamber (CH), each pot containing a plant from either Vi or V 2 treated 
with either A\ or A 2 such that all four combinations of (Vi, Aj) are represented in each 
growth chamber. Each pollutant is assigned randomly to two growth chambers. The 
four pots are randomly arranged in the growth chambers. Each plant is measured at 
three times, the measurements constituting the repeated measures. The data are given 
in Table 14.3a: 

(i) We include in the model statement the main effect P, V, A, TIME and all in¬ 
teractions among the corresponding factors up to three-factor interactions (as 
indicated by 3”). 
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Table 14.3 Repeated Measures in SPD(CRD, RCBD) 


a) Input statements: 
data pollutn; 

input PCH V A TIME PLANT y 
datalines; 

1 1 1 1 1 1 23 1 1 1 1 2 1 24 1 1 1 1 3 1 26 
1 1 1 2 1 2 20 1 1 1 2 2 2 24 1 1 1 2 3 2 25 
1 1 2 1 1 3 26 1 1 2 1 2 3 29 1 1 2 1 3 3 33 
112214 31 112224 35 112234 38 

1 2 1 1 1 5 25 1 2 1 1 2 5 26 1 2 1 1 3 5 27 
121216 30 121226 35 121236 36 
122117 28 122127 30 122137 34 
122218 32 122228 33 122238 36 
231119 40 231129 43 231139 45 
23 1 2 1 10 4423 1 22 10 47 23 1 23 1048 
2321111 48 2321211 50 2321311 55 

2 3 2 2 1 12 52 2 3 2 2 2 12 57 2 3 2 2 3 12 60 
2 4 1 1 1 13 45 2 4 1 1 2 13 47 2 4 1 1 3 13 50 
2 4 1 2 1 1445 24 I 22 144924 1 23 1452 
2 4 2 1 1 15 56 2 4 2 1 2 15 57 2 4 2 1 3 15 60 
24221 16 53 2422216 57 24223 16 59 

run; 

proc print data=poIIutn; 
title 1 ， POLLUTION DATA ’： 
run; 

proc mixed data=pollutn; 

class PCHV A TIME PLANT; 

model y=P|V|A{TIME @3/ddfm=satterth; 

random CH(P) V*A*CH(P); 

repeated/type=ar( 1) subject=PLANT(P*V*A) rcorr; 

lsmeans P V A TIME V*TIME A*TIME; 

title2 ’REPEATED MEASURES ANALYSIS'; 

title3 ’WITH AUTOCORRELATED ERRORS'; 

run; 
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Table 14.3 (Continued) 


b.) Output: 


POLLUTION DATA 


Obs P CH V A TIME PLANT y 

11 111 1 1 23 

2 1 111 2 1 24 

3 1 111 3 1 26 

41112 1 2 20 

5 1 1 1 2 2 2 24 

61112 3 2 25 

71121 1 3 26 

81121 2 3 29 

91121 3 3 33 

10 1 1 2 2 1 4 31 

11 1 1 2 2 2 4 35 

12 1 1 2 2 3 4 38 

13 1 2 11 1 5 25 

14 1 2 1 1 2 5 26 

15 1 2 1 1 3 5 27 

16 1 2 12 1 6 30 

17 1 2 1 2 2 6 35 

18 1 2 1 2 3 6 36 

19 1 2 2 1 1 7 28 

20 1 2 2 1 2 7 30 

21 1 2 2 1 3 7 34 

22 I 2 2 2 1 8 32 

23 1 2 2 2 2 8 33 

24 1 2 2 2 3 8 36 

25 2 3 1 1 1 9 40 

26 2 3 1 1 2 9 43 

27 2 3 1 1 3 9 45 

28 2 3 1 2 1 10 44 

29 2 3 1 2 2 10 47 

30 2 3 1 2 3 10 48 

31 2 3 2 1 1 11 48 

32 2 3 2 1 2 11 50 

33 2 3 2 1 3 11 55 

34 2 3 2 2 1 12 52 

35 2 3 2 2 2 12 57 

36 2 3 2 2 3 12 60 

37 2 411 1 13 45 

38 2 4 1 1 2 13 47 

39 2 4 1 1 3 13 50 

40 2 4 12 1 14 45 

41 2 4 1 2 2 14 49 

42 2 4 1 2 3 14 52 

43 * 2 421 1 15 56 

44 2 4 2 1 2 15 57 

45 2 4 2 1 3 15 60 

46 2 4 2 2 1 16 53 

47 2 4 2 2 2 16 57 

48 2 4 2 2 3 16 59 
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Table 14.3 {Continued) 


POLLUTION DATA 
REPEATED MEASURES ANALYSIS 
WITH AUTOCORRELATED ERRORS 

The Mixed Procedure 

Model Information 


Data Set 

Dependent Variable 
Covariance Structures 

Subject Effect 
Estimation Method 
Residual Variance Method 
Fixed Effects SE Method 
Degrees of Freedom Method 


WORK.POLLUTN 

y 

Variance Components, 
Autoregressive 
PLANT(P*V*A) 

REML 

Profile 

Model-Based 

Satterthwaite 


Class Level Information 


Class Levels Values 


P 

CH 

V 

A 

TIME 

PLANT 


2 12 

4 12 3 4 

2 12 

2 12 

3 12 3 

16 1 2 3 4 5 6 7 8 9 10 11 12 13 

14 15 16 


Iteration History 


Iteration 


Evaluations -2 Res Log Like 


Criterion 


3 

3 

1 

1 


157.97668907 

115.02581938 

114.57079107 

114.56527273 

114.56525763 


0.02422314 

0.00015995 

0.00000045 

0.00000000 


Convergence criteria met. 


Estimated R Correlation Matrix 
for PLANT(P*V*A) 1111 


Row 

Coil 

Col2 

CoI3 

1 

1.0000 

0.9355 

0.8751 

2 

0.9355 

1.0000 

0.9355 

3 

0.8751 

0.9355 

1.0000 


Covariance Parameter Estimates 


Cov Parm Subject 


Estimate 


CH(P) 
CH*V*A(P) 
AR ⑴ 
Residual 


3.5989 
1.66E-15 
0.9355 
8.8660 


PLANT(P*V*A) 
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Table 14.3 {Continued) 


Type 3 Tests of Fixed Effects 


Effect 


Num Den 

DF DF F Value Pr > F 


V 

PxV 

A 

P*A 

V*A 

P*V*A 

TIME 

P*IIME 

V*TIME 

P*V*TIME 

A*TIME 

P*A*IIME 

V-A^TIME 


1 2.01 8C.29 0.C120 
1 6.14 24.84 0.0023 
1 6.14 1.68 0.2414 
1 6.14 4.18 0.0857 
1 6.14 0.24 0.6413 
1 6.14 0.01 0.9339 

1 6.07 0.17 C.6908 

2 18 104.55 <.0001 
2 18 0.51 0.6102 
2 18 4.62 0.0240 
2 18 0.70 0.5088 
2 18 7.30 0.0048 
2 18 0.05 0.9486 
2 IS 0.51 0.6102 


Least Squares Means 


Standard 


Effec~ 

P 

V 

A 

TIME 

Estimate 

Error 

DF 

t Value 

P 

1 




29.4167 

1.6368 

2 .Cl 

17.44 

P 

2 




50.7917 

1.6868 

2.01 

30.11 

V 


i 



36.5000 

1.3948 

3.6 

26.17 

V 


2 



43.7083 

1.3948 

3.6 

31.34 

A 



1 


38.6250 

1.3948 

3.6 

27.69 

A 



2 


41.5833 

1.3948 

3.6 

29.8 ： 

TIME 




1 

37.3750 

1.2058 

2.1 

31.00 

TIME 




2 

4C.1875 

1.2058 

2.1 

33.33 

TIME 




3 

42.7500 

1.2058 

2.1 

35.45 

V*TIME 


1 


1 

34.0000 

1.4170 

3.83 

23.99 

V*7IME 


1 


2 

36.8750 

1.417C 

3.83 

26.C2 

V*TIME 


1 


3 

38.6250 

1.4170 

3.83 

27.26 

V-TIME 


2 


1 

40.7500 

1.4170 

3.83 

28.76 

V*TIME 


2 


2 

43.5C00 

1.4170 

3.83 

30.7C 

V*TIME 


2 


3 

46.8750 

1.417C 

3.83 

33.08 

A*7IME 



1 

i 

36.3750 

1.4170 

3.83 

25.67 

A^TIME 



1 

2 

38.2500 

1.4170 

3.83 

26.99 

A^TIME 



1 

3 

41.2500 

1.4170 

3.83 

29.11 

A^TIME 



2 

1 

38.375C 

1.4170 

3.83 

27.08 

A*TIXE 



2 

2 

42.1250 

1.4170 

3.83 

29.73 

A*TIME 



2 

3 

44.2500 

1.4170 

3.83 

31.23 


r > I r 1 

0.0032 

0.0011 

<.0001 

<.0001 

<.C00 ： 

<.0001 

0.0008 

0.0007 

0.0006 

<.0001 

<.00C1 

<.0001 

<.0001 

<.0001 

<_0C01 

<.0001 

<.00C1 

<.0001 

<.0001 

<.0001 

<.000 
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(ii) We use the option “ddfm=satterth，’ to determine appropriate d.f. 

(iii) The random terms “CH(P)” and “V*A*CH(P)” describe in technical terms the 
errors E(A) and E(B), respectively, where V*A*CH(P) includes V*CH(P) and 
A*CH(P). 

(iv) We specify AR(1) as the covariance structure for the repeated measures. 

(v) “PLANT(P*V*A)’’ specifies the subject for the repeated measures. 

The results of the analysis are given in Table 14.3b: 

(vi) The correlation between adjacent observations is obtained as r 二 .9355. 

(vii) The estimates of the variance components are obtained as 

CH(P)=3.60, V*A*CH(P) = 0, Residual = 8.87, respectively. 

(viii) Since with the inclusion of TIME, that is, repeated measures in the design (see 
Section 13.6)，there will be three different error terms for testing hypotheses 
about fixed effects: E(A) for P with 2 d.f, E(B) for V ， A ， V*A, P*V, P*A, P*V*A 
with 6 d.f. (determined by SAS to be 6.14 or 6.07), and E(C) for all terms involv¬ 
ing TIME with 18 d.f, (these include 2 d.f. from the P*V*A*TIME interaction) 

(ix) P,V,A, TIME, V*TIME, A*TIME are found to be significant. 

(x) A look at the LS means indicate that the significant interactions are co-directional. 

Hence, testing hypotheses about main effects is meaningful. □ 

14.4 EXERCISES 

1. Using the data from Example 14.3 perform the trend analysis as described in 
Section 14.2.3 and compare the results with those obtained in Table 14.2b. 

2. Using the data from Example 14.3 perform the analysis by making comparison 
at each time point. Compare the results with those obtained in Exercise 1 and 
those in Table 14.1b. 

3. Using the data from Example 14.3 perform the mixed model analysis using the 
assumption of compound symmetry. Compare the results to those obtained in 
Table 14.3b. 

4. Consider a CRD with subsampling and repeated measures for each subsample. 
Discuss how you would analyze the data and how you would perform the analy¬ 
sis using SAS PROC MIXED. 

5. Consider an RCBD with repeated measures. Discuss how you would analyze the 
data and how you would perform the analysis using SAS PROC MIXED. 
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Epilogue 


Let us now return to the conversation between the statistician (S) and the research sci¬ 
entist (R) described in the Preface. 

Several weeks after this conversation R and his research assistant (RA) pay a visit 
to S who also has one of his graduate students (GS) in his office. 

R: “Thank you for taking the time to talk to us. Our paper has been tentatively ac¬ 
cepted for publication, but the editor asked, among other things, for some clarification 
on our experimental design and the analysis. And this is where we hope you can help 
us. Since RA has done most of the work I’ll let her tell you what we did. ” 

RA: “I made five separate preparations for each of the three types of growth medium. 
Each preparation contained enough material to fill four pots. On a bench in the green¬ 
house I arranged 15 rows of 4 pots each. I randomly assigned the 15 preparations to 
the 15 rows of pots and filled each pot in a row with the assigned growth medium., In 
each row I then planted one flower from each of the four varieties in a separate pot. 
The plants were randomized separately in each row.” 

S: “That is very good. What can you tell me about the environmental conditions 
in the greenhouse. For example, is the amount of light different at both ends of the 
bench?” 

RA: “I understand what you mean, but the bench is arranged such that the light 
and temperature conditions are uniform over the entire bench. Also, all the plants were 
treated identically. For example, they all received the same amount of water，all at the 
same times. So, there should be no environmental differences.” 

S: “Fine. Now, for the analysis, what kind of data do you have?” 

R: “To evaluate the effectiveness of the growth media we developed an index which 
combined various aspects of growth, such as height, development of foliage, formation 
of buds and flowers. We made these observations every two weeks for 12 weeks. For 
the publication we reported only the results for week 12, because that is most important 
from an economical point of view, that is, from the producer’s point of view. From a 
scientific point of view it would also be informative to analyze the entire data set over 
the course of the 12 weeks, that is, using the 6 measurements (indexes) that we have.” 

S: “I see. GS has just completed a course on experimental design. I’ll ask him to 
help you with the analysis of the data, which will be different from the one you have 
performed, because the design is different from the one on which you have based your 
analysis. He can then help you with the interpretation of the results and explain to what 
extent they may differ from what you presented in the paper. He can also help you with 
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the analysis of the 12 weeks data.” 

GS: “Yes, I know how to do that.” 

R and RA: “Thank you very much for your help.” 

And you，the reader, are being challenged to consider this as an additional exercise 
and prepare a report on the type of design used in this experiment and how the analyses 
of the data should be performed. 
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73, 151,497, 523 
confirmatory, 29-30 
design of, 16, 20-22, 26, 29 
exploratory, 29-30, 475 
factorial, 59, 64, 419-422, 533 
asymmetrical, 64 
fractional, see Factorial(s) 
symmetrical, 64 
investigative, 144 
Lady tasting tea, 139-140 
mixture, 519, 523 
noisy, 142 
psychological, 545 
randomized, 145 

replicated randomized block, 307 
triangular, 140 
types of, 23 
Experimentation 

industrial, 30, 420, 497, 511, 543 
scientific, 30, 46 
sequential, 43 
Explanation, 18 

Factor (s) 

bet ween- subj ects, 545 
blocking, 32, 35, 106, 278, 306, 
313, 373,440, 580 


crossed, 308 
nested, 308 
classification, 32 
confounded, 101 
correction, 94 
crossed, 101 

easy-to-change, 511-512, 543 
efficiency, 333, 336 
hard-to-change, 511-513, 524, 543 
intrinsic, 35, 38-42,45, 51-53, 56, 
106, 134, 278, 313-314, 325, 
373, 440, 552 
level, 54 
nested, 106 

nonspecific, 35, 38-42, 45, 56, 134, 
278, 373 
qualitative, 52 
quantitative, 52 
split-plot, 534 

treatment, 32-35, 42, 51-53, 422 
whole-plot, 534 
within-subjects, 545 
Factorial(s), see also Design, 
Experiment(s) 
asymmetrical, 66, 476, 479 
complete, 511 

fractional, 64-66, 453-455, 462-, 
463, 472, 475, 479, 505-506 
of resolution III, 503 
of resolution IV, 503 
of resolution V ， 503, 511 

full, 503 

highly fractionated, 475 
mixed, 476, 548 
pure, 476 

symmetrical, 66, 476 
2 n , 422, 446, 462, 503, 509 
3 n , 465, 472, 505, 509 
Faraday, 13 
Fit 

badness of, 76 
lack of, 223, 502, 505, 524 
Frequency(ies) 

proportional, 96 
relative, 10 

Frequentist approach, 123 
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Function(s) 

estimable ， 78, 81 ， 125, 131 ， 242, 
459, 557 
identifiable, 81 
likelihood, 26 
linear, 130 
parametric, 137 
polynomial, 498 
quadratic, 130, 166 

Games of chance, 25 
Gauss-Markov 

linear model (GMLM), 124-125 
normal linear model (GMNLM), 
128-131, 137, 147 
properties, 147 
theorem, 124 

Half-normal plot, 441-443 
Heisenberg uncertainty principle, 2, 11 
Heraclitus, 12 
Heterogeneity, 239 

elimination of, 395 
of experimental units, 160 
Homogeneity, 193 
of groups, 229 
Hume, 9 

Huynh-Feldt condition, 545, 577 
Hypothesis, 6-7 

falsification of, 7 
reductionist, 14 
research, 32, 41-44, 53 
statistical, 32, 43 
working, 32 

Identifiability, 76, 114, 120 
Induction, 4, 6-7 
Inequality 

Bonferroni, 225 
Tchebycheff，139 
Inference, 16 

Bayesian, 26 

statistical, 24, 57, 122, 151 
types of ， 7, 36 

Information 

inter-block, 134, 553 


loss of, 478 

supplementary, 59, 239-242, 248, 
292 

Interaction 輪 419, 470, 475 
antidirectional, 319 
antagonistic, 319 

block-treatment, 278, 300-302, 306 
-308, 312-314, 317-319, 338 
codirectional, 313, 319 
components, 468-470, 475 
effects, see Effects, interaction 
first-order，421 

generalized, 449, 456, 474-475 
higher order, 420-421, 428 
linear x linear, 505 
lower order, 420 
plot, 320-321 

replication x treatment, 391-392 
row-column, 379 
simple, 424 
synergistic，313 
three-factor, 457, 471， 504 
treatment x design, 134 
treatment-time, 575, 580 
two-factor, 421, 456, 471, 504 
unit-treatment, 300-301, 314 
Interval(s) 

confidence, 27, 217 

estimation, 37 
simultaneous, 226-227 
statistical, 137 

Intervention, see Studies, intervention 

Jeffreys, 26 

Kant, 9, 17 
Keeton, 27 
Kepler, 4, 13 

Knut Vik square, 149-150 

Lagrange multipliers, 87 
Latin rectangle, 393-394 
Latin square(s), 62, 376, 548 

completely counterbalanced, 402 
complete orthogonalized, 397 
cyclic, 402 
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design (LSD), see Design, Latin 
square 

diagram-balanced, 547 
Graeco-, 396 
incomplete, 394, 402 
mutually orthogonal (MOLS), 397, 
403 。 

orthogonal, 395 
principle, 62-64, 390, 393 
reduced, 376 
Lavoisier, 13 
Law(s) 

Kepler’s，4 
Mendel’s，4 
of succession, 25 
Least squares 

analysis, 293-297, 335, 500 
fitting, 76, 80, 86, 119 
generalized (GLS) ， 126, 512 
mean (LSM), 244, 325-326, 

332, 422 

method of ， 37, 57, 77, 220-221 ， 
242, 466 

ordinary (OLS), 126, 512 
Leucippus, 132 
Level(s) 

coded, 500 
equidistant, 219 

significance, 172-173, 177-179,183 
Likelihood, 11 
function, 26 

residual maximum (REML), 580 
Linear model, 37-38, 44-46, 71-73 
affine, 85-86 
approximative, 77 
classificatory ， 74-75 
conditional, 85, 99 
derived, 68, 127 ， 159 ， 164, 278, 
314,537 
functional, 74 
Gaussian, 26 

Gauss-Markov (GMLN), 124-125 
128-131, 137, 147 
k-part, 97 
ordered, 90-94 
stochastic, 74, 77, 123 


theory, 71 

2- part, 90 

3- part, 94, 329 
Locke, 9 

Logic, Aristotelian, 9 
Loss 

of degrees of freedom, 290 
of information, 253 
of power, 290 
of sensitivity, 290 

Mathematics, foundations of, 6 
Matrix 

design, 506 

design-model, 466, 504 
generalized inverse, 81-83,125, 332 
idempotent symmetric (s.i.p), 78, 
84,87 " 

incidence, 118, 329, 333, 339, 509 
information, 330 
model，73 

Moore-Penrose (M-P) inverse, 
84-86, 124 
orthogonal, 126, 216 
projection, 91 

variance-covariance, 124-125, 578 
Maxwell, 13 

Mean, admissible, 107-108, 115 
Mean square(s) 

expected, 168 
synthetic error, 326-327 
Measure(s), 

repeated, 23, 573>574, 578 
summary, 575 
Measurement(s) 

process, 10, 22-24 
repeated, 23-24, 573 
scale of, 34,197 
variability, 24 
Mendel, 4 
Method(s) 

ANOVA, 580 
delta, 198 

of parallel tangents (PARTAN), 519 
of statistical differentials, 198 
of steepest ascent, 518 
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Mill, 18 

Model(s), see also Linear model 
approximate, 71， 74 
classificatory, 34, 74 
conditioned, 85-87 
first-order，500 
fitting a, 76 
fixed, 132-133, 323 
full, 430 
means, 99, 114 
misspecification, 513 
mixed, 132-134, 325 
multiplicative, 303 
nonlinear, 34 
nonorthogonal, 332 
overparameterized, 100, 115 
partitioned, 94 
polynomial, 519 
probability, 10 
random, 132-133, 325 
randomization, 159 
regression, 34 

first-order, 500 
second-order, 504 
relative frequency, 10 
statistical, 30, 34 

testing of, 7 
stochastic, 74, 128, 138 
subject matter, 32, 35, 51, 54 
subsampling, 191 
three-part, 94 

two-way classification, 329 
well-formulated, 109-110, 115 

Monte Carlo studies, 178 

Multicollinearity, 74-75 

Newton, 17 

Nonadditivity, 196, 300-302, 312, 386- 
387 

testing for, 303 

Noncentrality parameter, 131-132, 183 ， 
195 

Nonorthogonality, 400 

Normality assumption, 257 

Observation(s) 


adjusted, 241 
high-leverage, 258 
missing, 55, 295-298, 389 
estimated, 297 
multivariate, 34 
process, 1-2, 10 
supplementary, 258 
types of, 3 
univariate, 34 
validation of, 2 
Optimality, 59 
59 
D-, 59 
E-. 59 

Orthogonal array, 463-464 
Orthogonality, 59, 218, 400-402 

Period, 

extra, 400 
pre-，400 
wash-out, 398 

Plan 

main effect, 455, 480 

orthogonal, 462-463 
saturated, 464 

Plato, 12-13 
Plot(s) 

half-normal, 441-443 
interaction, 320-321 
split, 534, 537, 548, 556 
split-split, 560 
whole, 534, 537, 548, 556 
Poincare, 8, 25 
Points, 

axial, 506-508 
center, 506-508 
factorial, 506-508 
Polynomial(s) 

canonical, 521-522 
first-order, 499-500 
low-order, 499 

orthogonal, 220-222, 342, 441 ， 
505, 576 

Tchebycheff, 221 
Popper, 7-8 
Population, 
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marginal mean, 245 
reference, 323 
target, 32, 60 
Power 

explanatory 15 
of F-test, 182 
loss of, 290 
transformation, 200 
Precision 

increase in, 253 
of treatment comparisons, 278 
Predictive margin, 245 
Principle(s) 

of blocking, 34 
of experimentation, 29 
of indifference, 25 
Latin square, 62-64, 390, 393 
split-unit, 533 
Probability, 141 

conditional, 25 
continuous, 10 
degrees of, 25 
frequency theory of, 25-26 
joint, 24 
structure, 25 
theory, 72 
Procedure(s) 

Bonferroni, 225, 228 
Calinski-Corsten, 231 
Dunnett ? s, 228 

hierarchical agglomerative, 229 
hypothesis falsification, 7 
Johnson-Gray bill, 306 
Mandel’s, 302 

multiple comparison, 224, 250-251 
nonparametric, 228 
optimization, 518 
Satterthwaite, 326, 561 
Scheffe, 227-228 
stepwise, 229 

studentized range, 226, 229 
Tukey, 226, 252 
Tukey-Kramer, 226, 252, 269 
Process 

control, 30 
evolutionary, 1 


manufacturing. 24 
measurement, 10, 34 
observational, 1-2, 10 
production, 30 

randomization, see Randomization, 
process 
of science, 1 
sequential, 30 
stochastic, 26, 72 
Projection(s) 
matrix, 91 
orthogonal，91 
Projector ， orthogonal, 91 
Protocol 

experimental, 55, 139 
measurement, 10 
observation, 2 
Pythagoras, 12 

Quadratic form, 128 
Quality control, off-line, 477 

Ramsey, 26 

Randomization, 26, 34, 45, 55-56, 61, 
106, 137, 140-141, 147-151 ， 278 ， 
376 

analysis, 68-69, 180 
distribution, 26 
independent, 534, 538 
procedures, 27, 68, 111, 156-157, 
180, 377, 380, 533, 548 
process, 154, 157, 164, 171, 280-, 
281,315, 533, 537 
repetitions of, 158 
restricted, 280, 291 
test, 26, 69, 150,172-173, 180, 285, 
385 

approximation to the, 69 ， 
173-174, 193,217, 
248,286-288,317, 
538 

theory, 134, 177, 287, 303, 382 
unrestricted, 291 
Random numbers, 154 
Random variable(s), 22-24 
Bernoulli, 155 
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design, 68, 154-155, 158, 280, 379, 
537 

Gaussian, 22 
normal, 129 

multivariate, 129 
Range of validity, 60 
Region 

experimental (ER), 498, 503 
operational (OR), 498 
Regression 

analysis, 220, 497, 502 
coefficient, 221, 304, 466-467, 505, 
528 
line, 302 
polynomial, 262 
second-order linear, 504 
Relation, see Relationship 
Relationship 

defining, 455-456 
functional, 498 
identity, 455, 475, 479 
Reparameterization, 81 
Repetition(s), 139 

population of, 137-138 
Replication(s), 45, 61 
fractional, 451-453 
number of 180, 184, 186-190, 193- 
195 

effective, 451 
unequal, 179-180 
Residual, see Error 
Response(s) 

conceptual, 157-158,161, 281, 315, 
379, 537 
curve, 497, 514 
observed, 315, 379 
optimum, 499, 503 
predicted, 501 
Response surface, 497-500 
design, 497-499 
first-order, 503 

methodology (RSM), 497-499, 519, 
523 

second-order, 509 
Rightmost bracket, 107, 111 


Rotatability, 506 

Sartre, 9 
SAS, 69 

PROC FACTEX, 451, 459,486-491 
PROC GLM, 201 ， 230-232, 264, 
269, 343, 348, 353, 404, 407, 

430, 443, 446, 481,523, 562, 

580 

PROC IML, 86 

PROC MIXED, 201, 343, 348, 353, 
483, 523, 562, 568, 578-580 
PROC PLAN, 154, 180, 278-279 ， 
315-316, 377-378, 481 
PROC POWER, 185 
PROC REG, 523 
PROC RSREG ， 523-524 
Science(s) 

descriptive, 9 
exact, 12 
general, 14 
history, 5 
physical, 14 
process of, 1 
type of, 9 

Scientific objective, 58 
Scope of validity, 277 
Sensitivity of experiment, 45 
Simplex 

coordinate system, 519 
design, 504 

^-dimensional, 504, 519 
Slope(s) 

common, 253 
equality of, 269 
Socrates, 6, 12 
Space 

column, 91 
error, 129 
estimation, 129 
row, 76 

Statistical Analysis System, see SAS 
Statistical software, see SAS 
Statistical triangle, 46 
Structure(s) 

alias, 454-455, 473-475 
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blocking, 45 

nested, 353, 428, 440 
classificatory, 100, 106 

balanced, 100-101 
correlation, 577 

estimated, 588 

covariance, 127,160, 164,168, 545, 
575, 578 

compound symmetry, 577- 
580 

first-order autoregressive, 
578-580 

spatial power, 579 
unstructured, 579-580 
data, 100 

balanced, 107-108, 111- 

112 , 

classificatory ， 99, 118 
unbalanced, 112 
diagram(s), 110-111 
error, 56 

factor balanced, 112 
factorial, 42, 64, 106, 419-421 ， 

440, 543, 552. See also 
Factorial(s) 

asymmetrical, 64 
symmetrical, 64 
Latin square, 380 
variance-covariance, 127 
Studies, see also Experiment(s) 

experimental, 104-106, 138, 149, 
intervention, 106, 137-138,149 
observational, 104-106, 134, 137- 
138, 149 

preliminary, 185, 191 
simulation, 177, 286. See also 
Monte Carlo studies 
Subject matter knowledge, 328, 422 
Subsample, size of, 195 
Subsampling, 34, 40, 67-68, 191-193, 

288, 353 

Sub-subsampling, 67-68 
Sum(s) of squares, 37, 426 
partial, 98, 550 
sequential, 98 
Type I, 98, 330 


Type III， 98 
Syllogism, 5-6 
basic, 5-6 

Symmetry, compound, 577-580 
Synergism, 43 

Taylor series expansion, 421 
Test(s) 

Bonferroni, 225 
criterion, 151, 172 
Duncans multiple range, 226, 251 
F-, 151, 174, 177, 217, 285, 502, 
538 

power of, 182 
Fishers protected LSD, 225 
F-max, 323 

of hypotheses, 7, 37, 57, 131, 171 
lack-of-fit, 223,502-505 
preliminary, 313, 323 
randomization, 69, 177, 285. See 
also Randomization, test 
randomized triangular, 140 
significance, 7, 37, 148-151, 165, 
542 

size of, 183 
statistical, 137 
studentized range, 226 
Mike, 257 
treatment, 226-227 
Tukey’s ， 303-305, 388 
Tetrahedron, 504 
Thales of Miletus, 12 
Theorem 

Aitken, 125-127 
central limit, 148 
Gauss-Markov, 125-126 
Theory 

axiom, 13 

development of, 4, 10 
falsifiable, 14 

Gauss-Markov normal linear model 
(GMNLM), 174-176 
mathematical, 11 
normal, 285-286 
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randomization, 176-177 ， 285-286. 
See also Randomization, 
theory 
scientific，8 
statistical, 11 
types of, 11 
Time series, 17 

Transformation(s), 196-199, 312 
to additivity, 388 
power, 200 
sets, 376-377, 384 
Treatment(s) 

combinations, 420 
control, 227 

design, see Design, treatment 

factorial, 64 

mean, 

adjusted，244 
qualitative, 52, 213, 497 
quantitative, 52, 219, 497 
split-plot, 539-540, 543, 560 
effect, 539 

split-split-plot, 560, 561 
test, 227 

whole-plot, 539-540, 543, 560 
effect, 539-540 

Trend, 340 

analysis, 575 
linear, 223, 341-343 
overall, 577 

Trial 

agronomic, 278 
Bernoulli, 25 
binomial, 24 

randomized clinical, 32, 35 
uniformity, 290-291, 543 
Triangle, equilateral, 504 


Tycho Brahe, 13 

Unbiasedness, 59 
Unit(s) 

error, 160 

experimental (EU), 20, 34, 38, 

68, 138,153, 533 
observational (OU), 34, 38, 68 
sampling, 68 

Variability, see Variation 
Variable(s) 

classificatory, 118 
coded, 514 

concomitant, 15, 76, 118 
explanatory, 35-38, 74, 151 
function of, 71 
mathematical, 11-12, 22 
process, 523 

random, see Random variable(s) 
regressor, 302 
response, 35-36, 52 
Variance(s)，514 

average, 251-252, 336 
estimator, 248, 562 
experimental error, 317, 557 

component, 163, 193,288 
nonconstancy of, 197 
observational error, 317 

component, 163, 193, 288 
prediction, 506 
Variation, 26 

induced, 277 
random, 38, 239 

sources of, see Analysis of variance, 
table 

systematic, 45, 62, 239 
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